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Field of the Invention 

The present invention relates generally to canine 
coronavirus infections, and specifically to proteins useful 
in prophylaxis, therapy, and diagnosis of these infections 
15 in canines. 

Background of the Invention 

The coronaviruses are a large family of mammalian 
and avian pathogens which were first described in 1968. 
They are the causative agents of several diseases including 

20 encephalitis, hepatitis, peritonitis and gastroenteritis. 

Enteric coronaviruses have been detected in the feces of 
man, pigs, calves, cats, mice, chickens and dogs. 

Canine coronavirus (CCV) enteritis was first 
isolated from dogs suffering an acute gastroenteritis, as 

15 reported by Binn et al., Proc. 78t h Ann. Mtq. U.5. Animal 

Health Assoc. . Roanoke VA, pp. 359-3 66 (1974) . The disease 
became prevalent during the 1970s. CCV gastroenteritis 
appears to be primarily transmitted through fecal 
contamination from infected dogs via the oral route, 
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leading ultimately to replication of tlie virus in the 
epithelial cells of the small intestine. Virus can be 
recovered from the feces of an infected dog between 3 and 
14 days after infection. 
5 CCV gastroenteritis is characterized by a mild 

depression , anorexia and loose stool from which the dog 
usually recovers. The onset of the disease is often 
sudden, accompanied by such symptoms as diarrhea, vomiting, 
excreted blood in stools, and dehydration. Deaths have 
10 occurred within as little as 24 to 3 6 hours after onset of 

clinical signs. Most dogs appear afebrile but elevated 
body temperature is seen in some cases. Often CCV will 
occur with a canine parvovirus infection and this 
coinfection can be fatal. 
15 Serologically the disease is closely related to 

transmissible gastroenteritis virus of swine (TGEV) . 
Although canine coronavirus does not infect pigs, 
transmissible gastroenteritis virus produces a subclinical 
infection in dogs. However, unlike the feline infectious 
20 peritonitis coronavirus (FIPV) , previous exposure to CCV 

does not predispose dogs to enhanced disease/ and antigen- 
antibody complexes r if formed, are not associated with 
disease pathology. 

There remains a need in the art for compositions 
25 useful in diagnosing, treating and preventing infections 

with canine coronaviruses- 

Summary of the Invention 

In one aspect the present invention provides the 
complete nucleotide sequence of the CCV s gene, strain 1- 
71 r SEQ ID N0:l. The S gene or fragments thereof may be 
useful in diagnostic compositions for CCV infection* 

In another aspect the present invention provides 
a ccv s (or spike) protein characterized by the amino acid 
seguence of a CCV S protein r SEQ ID NO: 2, and peptide 
fragments thereof. *Ehese proteins may be optionally fused 
or linked to other fusion proteins or molecules. 

XXID <WO 932342aA 1_l_> 
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Thus, in another aspect, the present invention 
v provides a vaccine composition containing an effective 

immunogenic amount of at least one CCV S protein or an 
immunogenic fragment thereof. 
5 In still another aspect, the invention provides 

a method of vaccinating an animal against infection with a 
coronavirus by administering an effective amount of a 
vaccine composition of this invention. 

In yet a further aspect, the present invention 
10 provides a pharmaceutical composition for the treatment of 

CCV infection comprising a therapeutically effective amount, 
of a CCV S peptide or protein of the invention and a~ 
pharmaceutically effective carrier. 

Still another aspect of this invention is an 
15 antibody directed to CCV, which antibody is capable of 

distinguishing between CCV and other canine viruses. These 
antibodies may also be employed as diagnostic or 
therapeutic reagents. 

in yet another aspect, a diagnostic reagent of 
20 the present invention comprises a CCV s protein or fragment 

thereof. In another aspect, the present invention provides 
a diagnostic reagent which comprises a nucleotide sequence 
which encodes a CCV S protein or fragment of the invention, 
and/ or a nucleotide sequence which flanks the coding 
25 region, or fragments thereof. These protein and nucleotide 

sequences are optionally associated with detectable labels. 
Such diagnostic reagents may be used to assay for the 
presence of CCV in dogs using standard assay formats . and 
can form components of a diagnostic kit. 
* 3 0 In a further aspect, the invention provides a 

method of using a diagnostic reagent of this invention to 
identify dogs which are uninfected or which have been 
previously exposed to CCV. The diagnostic method can 
differentiate exposure to CCV from exposure to other 
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related coronaviruses , allow the identification of dogs 
which have been vaccinated against these diseases, and 
allow one to distinguish between different strains of CCV, 
or to identify dogs at advanced stages of CCV infection. 
5 In yet a further aspect, the invention provides 

a method for the production of a recombinant CCV protein 
comprising culturing a selected host cell, e.g., a 
mammalian cell or viral vector, transformed with a DNA 
sequence encoding a selected CCV S protein or fragment 

10 thereof in operative association with regulatory sequences 

capable of regulating the expression of said protein. 

Another aspect of the invention is a recombinant 
DNA molecule comprising a DBA sequence coding for a 
selected portion of a canine coronavirus s protein, the DNA 

15 sequences in operative association with regulatory 

sequences capable of directing the expression thereof in 



invention are described further in the following detailed 
20 description of the preferred embodiments thereof. 

Detail ed Description of the Invention 

The present invention provides novel isolated 
canine coronavirus (CCV) s proteins and fragments thereof, 
as well as isolated nucleotide 

25 proteins or fragments. These proteins and 

useful for diagnostic, vaccinal and therapeutic 
compositions as well as methods for using these 
compositions in the diagnosis, prophylaxis and treatment of 
CCV-relatiea and other coronavirus -related conditions. 

3 0 X. Definitions 

As defined herein, an amino acid fragment is any 
amino acid sequence from at least about a amino acids in 
length up to about the full-length CCV $ gene protein. A 
nucleotide fragment defines a nucleotide sequence which 
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encodes from at least: about 8 amino acids in length up to 
about the full-length CCV S gene protein. 

The term "region" refers to all or a portion of 
a gene or protein, which may contain one or more fragments 
5 as defined above. 

The term "immunogenic" refers to any S gene 
protein or fragment thereof, any molecule 7 protein, 
peptide, carbohydrate, virus, region or portion thereof 
which is capable of eliciting a protective immune response 
10 in a host, e.g., an animal, into which it is introduced. 0 

The term "antigenic" refers only to the ability 
of a molecule, protein, peptide, carbohydrate, virus, 
region or portion thereof to elicit antibody formation^ ib^ : 
a host (not necessarily protective) • 
15 As used herein, the term "epitope" refers to a 

region of a protein which is involved in its 
immunogenicity, and can include regions which induce B cell 
and/or T cell responses. 

As used herein, the term "B cell site or T cell 
20 site" defines a region of the protein which is a site for 
B cell or T cell binding. Preferably this term refers to 
sites which are involved in the immunogenicity of the 
protein. 

JT. Sources of CCV Sequences 

25 The examples below specifically refer to newly 

identified spike gene sequences from canine coronavirus 
(CCV) strain 1-71. This strain is deposited with the 
American Type Culture Collection (ATCC) , 12301 Parklawn 
Drive, Rockville, Maryland under Accession No. VR-809. 

30 Particularly disclosed are nucleotide and amino acid 
sequences, SEQ ID N0:1 and 2, respectively, of the CCV S 
gene. 
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The present invention is not limited to the 
particular CCV strain employed in the examples, other CCV 
strains have been described, e.g., strain CCV-TN449 [ATCC 
2068]. Utilizing the teachings of this invention, 
5 analogous fragments of other canine coronavirus strains can 

be identified and used in the compositions of this 
invention. 

XJjT. CCV Nucleotide and Amino Acid Sequences of the 

Invention - • 

10 The inventors have identified and selected 

nucleotide and protein sequences of CCV strain 1-71 which 
have been determined, to be of interest for use as vaccinal, 
therapeutic and/ or diagnostic compositions. For example, 
selected peptide and nucleotide sequences present primarily 
in the variable N terminal region of the CCV S protein and 
gene are characterized by representing areas of homology 
between FIFV, TGEV, feline enteric coronavirus (FECV) and 



Peptide fragments obtained from this 
20 heterogeneous N" terioinal of the S protein are useful 

fragments for diagnostic compositions and kits for 
distinguishing between infection with CCV strain 1-71 from 
other CCV infections, and for distinguishing between 
infection with CCV and other coronavirus identified above 
25 in a vaccinated or infected dog, as well as for use in 

vaccine and therapeutic agents; 

Additionally,- the amino terminal sequences of CCV 
S protein include peptide sequences which are B cell sites 
and thus useful in vaccinal or therapeutic compositions, or 
3 0 for generating antibodies to CCV, in assays for the 

detection of CCV antibodies in dogs. 

Xn addition, certain peptide fragments of the CCV 
S protein are believed to represent T cell sites, and thus 
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Other suitable CCV amino acid regions for 
pharmaceutical or diagnostic use are located within other 
regions of the CCV S protein SEQ ID NO: 2. These amino 
acid and nucleotide fragments of the CCV S protein and its 
5 nucleotide sequence discussed above are specifically 

reported below in Tables I and II. Table II also reports 
the respective homologies of certain of these desired 

■ 

fragments to wild-type FIPV, i.e., FIPV WSU 1146. The CCV 
S nucleotide fragments in Tables I and II can be useful for 
10 diagnostic probes, PCR primers, or for use in recombinant 

production of relevant S protein fragments for use in 
therapeutic or vaccinal compositions. Other suitable 
fragments may also be identified for such use. 
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Table I 




ccv Amino 


Acids 




B cell sites 


T cell sites 


SEO ID NOS: 


50-250 




3 


375-425 




4 


450-470 




5 


550-600 




6 


650-700 




7 


770-850 




8 


900-1025 




9 


1150-1225 




10 


1250-1452 




11 




40-47 


12 




63-81 


13 




187-191 


14 




241-274 


15 




335-341 


16 




395-428 


17 




468-494 


18 




846-860 


19 




916-952 


20 




977-992 


21 




1068-1145 


22 




1366-1391 


23 
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Table II 







Amino 


Acid Seouences 




5 


CCV 

Amino Acid 


1-71 
Nucleotides 


% Homology CCV 1-71 

to WT FIPV WST7 114 6 


SEQ ID NOS. 




1113-1236 


3337-3708 


100 






540-599 


1618-1797 


93 . 3 






-J *t ^ -J o o 




93. 6 


29 and 28 




137-153 


409-459 


64.7 


31 and 30 


10 


375-388 


1123-1164 


85.7 


33 and 32 




1424-1440 


4270-4320 


94.1 


35 and 34 




14 07-1420 


4219-4260 


85.7 


37 and 36 




1342-1406 


4024-4218 


96.9 


39 and 38 




398-652 


1192-1956 


93.3 


41 and 40 


15 


128-555 


382-1665 


89.5 


43 and 42 




447-628 


1339-1884 


91.8 


45 and 44 



Modified Sequences of tie Invention. 

In addition to the amino acid sequences and 

2 0 corresponding nucleotide sequences of the specifically- 

recited embodiments of CCV S proteins of this invention, 
the invention also encompasses other DNA and amino acid 
sequences of CCV S proteins. such other nucleic acid 
sequences include those sequences capable of hybridizing to 
25 SEQ ID NO: 1 under conditions of at least 85% stringency, 

i.e. having at least 85% hpmology to the sequence of SEQ ID 
NO: l, more preferably at least 90% homology, and most 
preferably at least -95% homology. Such homologous 
sequences are characterized by encoding a CCV S gene 

3 0 protein related to strain 1-71. 
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Further, allelic variations (naturally-occurring 
base changes in the species population which may or may not 
result in an amino acid change) of DNA sequences encoding 
the various S amino acid or DNA sequences from the 
5 illustrated CCV are also included in the present invention, 

as well as analogs or derivatives thereof. Similarly, DNA 
sequences which code for protein sequences of the invention 
but which differ in codon sequence due to the degeneracies 
of the genetic code or variations in the DNA sequence 

10 encoding these proteins which are caused by point mutations 

or by induced modifications to enhance the activity, half^ 
life or production of the peptide encoded thereby are also 
encompassed in the invention. 

Variations in the amino acid sequences of this 

15 invention may typically include analogs that differ by only 
1 to about 4 codon changes. Other examples of analogs 
include polypeptides with minor amino acid variations from 
the natural amino acid sequence of S gene proteins and/or 
the fusion partner; in particular, conservative amino acid 

20 replacements. Conservative replacements are those that 
take place within a family of amino acids that are related 
in their side chains. Genetically encoded amino acids are 
generally divided into four families: (1) acidic = 
aspartate, glutamate; (2) basic = lysine, arginine, 

25 histidine; (3) non-polar = alanine, valine f leucine, 

isoleucine, proline, phenylalanine, methionine, tryptophan; 
and (4) uncharged polar = glycine, asparagine, glutamine, 
cysteine, serine, threonine, tyrosine. Phenylalanine, 
tryptophan, and tyrosine are sometimes classified jointly 

30 as aromatic amino acids. For example, it is reasonable to 

expect that an isolated replacement of a leucine with an 
isoleucine or valine, an aspartate with a glutamate, a 
threonine with a serine, or a similar conservative 
replacement of an amino acid with a structurally related 
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amino acid will not have a significant effect on its 
activity, especially if the replacement does not involve an 
amino acid at an epitope of the polypeptides of this 
invention. 
5 V. Fusion Proteins + 

If desired, the CCV s proteins and peptide 
fragments, e.g. those identified in Tables I and II, can be 
produced in the form of fusion proteins as defined below. 
Such a fusion protein may contain either a full-length CCV 
10 S protein or an immunogenic fragment thereof. Suitable 

fragments include those contained within SEQ ID NO: 2 and 
the amino acids fragments of Tables I and II. Other 
suitable fragments can be determined by one of skill in the 
art by analogy to the sequences provided herein. 
15 Proteins or peptides may be selected to form 

• fusion proteins with the . selected S protein or peptide 
sequence based on a number of considerations. The fusion 
partner may be a preferred signal sequence, a sequence 
which is characterized by enhanced secretion in a selected 
20 host cell system, or a sequence which enhances the 

stability or presentation of the S^derived peptide. Such 
exemplary fusion partners include, without limitation, 
ubiquitin and a mating factor for yeast expression systems, 
and beta-galactosidase and influenza NS-l protein for 
bacterial systems. . One of skill in the art can readily 



expression system. The present invention is not limited to 
the use of any particular fusion partner. 

The CCV S protein or fragments thereof can 
30 optionally be fused to each other or to the fusion partner 

through a conventional linker sequence, i.e., containing 
about 2 to 50 amino acids, and more preferably, about 2 to 
about 20 amino acids in length. This optional linker may 
provide space between the two linked sequences. 
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Alternatively, this linker sequence may encode, if desired, 
a polypeptide which is selectively cleavable or digestible 
by conventional chemical or enzymatic methods. For 
example, the selected cleavage site may be an enzymatic 
5 cleavage site, including sites for cleavage by a 
proteolytic enzyme, such as enter ©kinase, factor Xa, 
trypsin, collagenase and thrombin. Alternatively, the 
cleavage site in the linker may be a site capable of being 
cleaved upon exposure to a selected chemical, e.g., 

10 cyanogen bromide or hydroxy lamine. The cleavage site, if 

inserted into a linker useful in the fused sequences .. of 
this invention, does not limit this invention. Any desired 
cleavage site, of which many are known in the art, may^be* 
used for this purpose. 

15 VT. Production of Sequences of Invention 

The CCV S gene protein of the invention and amino 
acid regions, fragments thereof and their corresponding 
nucleotide sequences, as well as other proteins described 
herein, e.g. fusion partners, may be produced by^ 

20 conventional methods. These proteins or fragments and the. 
nucleotide sequences may be prepared by chemical synthesis 
techniques [Merrifield, J.A.C.S. , 85:2149-2154 (1963)]. 
Preferably, however, they are prepared by known recombinant 
DNA techniques by cloning and expressing within a host 

25 microorganism or cell a DNA fragment carrying, a coding 
sequence for the selected protein. See, e.g., Sambrook et 
al, "Molecular Cloning. A Laboratory Manual", 2nd edit., 
Cold Spring Harbor Laboratory, New York (1989) . Such 
techniques are discussed below in the Examples. 

30 According to cloning techniques, a selected gene 

fragment of this invention can be cloned into a selected 
expression vector. Vectors for use in the method of 
producing S protein proteins comprise a novel S gene DNA 
sequence (or a fragment thereof) of the invention and 
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selected regulatory sequences in operative association with 
the DNA coding sequence, and capable of directing the 
replication and expression of the peptide in a selected 
host cell. 

5 Vectors, e.g. , polynucleotide molecules, of the 

invention may be designed for expression of CCV s proteins 
and/or fusion proteins in bacterial, mammalian, fungal or 
insect cells or in selected viruses, suitable vectors are 
known to one skilled in the art by resort to known 
10 publications or suppliers. 

The resulting DNA molecules or vectors containing 
nucleotide sequences encoding the canine coronavirus s 
peptides or fragments thereof and/ or encoding the fusion 
proteins are then introduced into host cells and expression 

15 of the heterologous protein induced. 

Additional expression systems may include the 
known viral expression systems, e.g., vaccinia , fowlpox, 
swine pox. It is understood additionally, that the design 
of the expression vector will depend on the choice of host 

20 cell. A variety of suitable expression systems in any of 

the below-identified host cells are known to those skilled 
in the art and may be. readily selected without undue 



Suitable cells or cell lines for use in 
25 expressing the S protein or peptides of this invention can 

be eukaryotic or prokaryotic A preferred expression 
system includes mammalian cells, such as Chinese Hamster 
ovary cells (CHO) or COS-l cells. The selection of other 
suitable mammalian host cells and methods for 
3 0 transformation, culture, amplification, screening and 

product production and purification are known in the ait. 
See, e.g., Gethirig ^n<i Sambrook, Nature , 293 :620-625 
(1981) , or alternatively, Kaufman et al, Mol. Cell. Biol. . 



XJCID: <WO_9323423A1J_> 



<• *\ WO 93/23423 PCT/US93/04692 



13 

5(7) : 1750^1759 (1985) or Howley et al, U. S. Patent 
4,419,446. Also desirable are insect cell systems, such as 
the baculovirus or Drosophila systems. The selection of 
other suitable host cells and methods for transformation, 
5 culture, amplification, screening and product production 

and purification can be performed by one of skill in the 
art by reference to known techniques. See, e.g., Gething 
and Sambrook, Nature , 293 : 620-625 (1981). 

After the transformed host cells are 

10 conventionally cultured for suitable times and under 

suitable culture conditions known to those skilled in the 
art, the cells may be lysed. It may also be possible, 
depending on the construct employed, that the recombina^bv*: 
proteins are secreted extracellularly and obtained from the: 

15 culture medium. Cell lysates or culture medium are then 

screened for the presence of CCV S protein or peptide which 
are recognized by antibodies, preferably monoclonal 
antibodies (MAbs) , to a peptide antigenic site from CCV. 

Similarly, the fusion proteins may be produced by 

20 resort to chemical synthesis techniques , or preferably-/' 
recombinant methods , as described above. The selected 
primer sets used in the PCR reaction described in the 
Examples below may be designed to produce PCR amplified 
fragments containing restriction endonuclease cleavage site 

25 sequences for introduction of a canine coronavirus S gene 

fragment in a specific orientation into a selected 
expression vector to produce fusion proteins of the 
invention. The vector may contain a desired protein or 
fragment thereof to which the S gene fragment is fused in 

30 frame to produce a fusion protein. 

The crude cell lysates containing the CCV S 
protein or peptides or fusion proteins can be used directly 
as vaccinal components, therapeutic compositions or 
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diagnostic reagents. Alternatively , the CCV s peptides can 
be purified from the crude lysate or medium by conventional 
means . 

VXX. Vaccine Compositions 

5 The CCV S proteins and immunogenic fragments of 

this invention may be incorporated in a vaccine 
composition. Such a vaccine composition may contain an 
immunogenic amount of one or more selected CCV s peptides 
or proteins, e.g., encoded by the complete S gene sequence 
10 of CCV or partial sequences thereof, and prepared according 

to the method of the present invention, together with a 
carrier suitable for administration as a vaccine 
composition for prophylactic treatment of CCV infections. 
The protein may be in the form of a fusion protein as 
15 above-described » Alternatively, the CCV S gene or fragment 

may be incorporated into a live vector, e-g., adenovirus, 
vaccinia virus and the like. The expression of vaccinal 
proteins in such live vectors are well-known to those in 
the art [See, e.g., U. S. Patent No. 4,920,209]. It is 
20 preferable, that the protein employed in the vaccine 

composition induces protective immune responses against 
more than one strain of CCV* 

A vaccine composition according to the invention 
may optionally contain other immunogenic components. 
25 Particularly desirable are vaccine compositions containing 
other canine antigens , e.g., canine distemper, Borrelia 
burgdorferi, canine Bordetella, rabies, canine parvovirus, 
Leptosporidia sp. r canine rotavirus, canine parainfluenza 
virus and canine adenovirus. 
30 In another embodiment, the CCV S proteins may be 

used in a combination vaccine directed to related 
coronaviruses. Other suitable coronaviruses which can be 
used in such a combination vaccine include a feline 
coronavirus, such as FIPV or FECV. For example, a CCV S 
35 peptide or protein of the present invention may be employed 
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as an additional antigen in the temperature sensitive FIPV 
vaccine described in detail in co-owned, co-pending U. S. 
Patent Application Ser. No. 07/42 8,796 filed October 30, 
1989, incorporated by reference herein. Alternatively, the 

5 CCV S protein or peptide or a fragment thereof could be 

used in a vaccine composition containing other coronavirus 
S proteins or fragments thereof, particularly those 
described in co-pending, co-owned U.S. Patent application 
Ser. No. 07/698,927 (and its corresponding published PCT 

.0 Application No. WO92/08487) . 

The preparation of a pharmaceutical^ acceptable 
vaccine composition, having appropriate pH isotonicity, 
stability and other conventional characteristics is withH^ 
the skill of the art. Thus such vaccines may optimally 

.5 contain other conventional components, such as adjuvants 

and/ or carriers, e.g. aqueous suspensions of aluminum and 
magnesium hydroxides, liposomes and the like. 

The vaccine composition may be employed to 
vaccinate animals against the clinical symptoms associated 

0 with ccv. The vaccines according to the present invention- 

can be administered by an appropriate route, e.g., by 'the 
oral, intranasal, subcutaneous, intraperitoneal or 
intramuscular routes. The presently preferred methods of 
administration are the subcutaneous and intranasal routes. 

5 The amount of the CCV S peptide or protein of the 

invention present in each vaccine dose is selected with 
regard to consideration of the animal f s age, weight, sex, 
general physical condition and the like. The amount 
required to induce an immunopr ot ect i ve response in the 

0 animal without significant adverse side effects may vary 

depending upon the recombinant protein employed as 
immunogen and the optional presence of an adjuvant. 
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Generally,, it is expected that each dose will comprise 
between about 0*05-5000 micrograms of protein per mL, and 
preferably 0- 05-100 micrograms per mli of a sterile solution 
of an immunogenic amount of a protein or peptide of this 
invention* Initial doses may be optionally followed by 
repeated boosts , where desirable. 

. Another vaccine agent of the present invention is 
an anti-sense RNA sequence generated to the S gene of CCV 
strain 1-71 [SEQ ID NO:l] [S. T. Crooke et al, Biotech. , 
10:8 82-8 8 6 (Aug. 1992)]. This sequence may easily be 
generated by one of skill in the art either synthetically 
or recombinantly. Under appropriate delivery, such an 
anti-sense RNA sequence when administered v to an' infected 
animal should be capable of binding to the RNA of the 
virus, thereby preventing viral replication in the cell. 
VIII, Pharmacetrbiaal Compositions 

The invention also provides a pharmaceutical 
composition comprising one or more CCV S peptides or 
proteins prepared according to the present invention and a 
P^™ acell 'fcAcally effective carries:. Suitable 
phannaceutically effective carriers for internal 
administration are known to those skilled in the art. One 
selected carrier is sterile saline* The pharmaceutical 
composition can be adapted for administration by any 
appropriate route, but is designed preferentially for 
administration by injection or intranasal administration. 
I*Z. Antibodies of the Invention 



development of an antibody to 
above identified amino acid 
s protein, which epitope is 
CCV strains or other coronaviruses 
FECV. The antibody pan be 
antigenic substance, a peptide 



the 

or more epitopes in the 
derived from the CCV 
from those of other 
, e.g. FIPV, TGEV or 
employing as an 
I or II. 
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Alternatively , other regions of the CCV strain 1-71 S 
protein SEQ ID NO: 2 may be employed in the development of 
an antibody according to conventional techniques. 

In one embodiment, the antibody is capable of 
5 identifying or binding to a CCV antigenic site encoded by 

SEQ ID NO; 1 or a fragment thereof. such an antibody may 
be used in a diagnostic screening test, e.g., as a 
hybridization probe, or as a therapeutic agent. 

Antibodies which bind CCV peptides from the 
10 regions identified above or to other regions capable, of 

distinguishing between CCV, TGEV, FIPV, FECV, and other 
coronaviruses for use in the assays of this invention may 
be polyclonal. However, it is desirable for piirposesvofe 
increased target specificity to utilize MAbs, -both in the 
15 assays of this invention and as potential therapeutic and 

prophylactic agents. Additionally, synthetically designed 
MAbs may be made by known genetic engineering techniques 
[W. D. Euse et al, Science . 246 ; 1275-1281 (1989)] and 
employed in the methods described herein. For purposes of 

2 0 simplicity the term MAb(s) will be used throughout this, 

specification; however, it should be understood that 
certain polyclonal antibodies, particularly high titer 
polyclonal antibodies and recombinant antibodies, may also 
be employed. 

25 A MAb may be generated by the well-known Kohler 

and Milstein techniques and modifications thereof and 
directed to one or more of the amino acid residue regions 
identified above, or to other CCV S peptides' or epitopes 
containing differences between CCV strain 1-71 and other 

3 0 coronaviruses. For example, a fragment of SEQ ID NO: 2 

which represents an antigenic site, which differs from that 
of FIPV, may be presented as an antigen in conventional 
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techniques for developing MAbs. One of skill in the art 
may generate any number of MAbs by using fragments of the 
amino acid residue regions identified herein as an 
immunogen and employing these teachings. 
5 For diagnostic purposes, the antibodies (as well 

as the diagnostic probes) may be associated with individual 
labels. Where more than one antibody is employed in a 
diagnostic method f the labels are desirably interactive to 
produce a detectable signal. Most desirably, the label is 
10 detectable visually, e.g. colorimetrically. Detectable 

labels for attachment to antibodies useful in the 
diagnostic assays of this invention may also be easily 
selected by one skilled in the art of diagnostic assays, 
amont which include, without limitation, -horseradish 
15 peroxidase (HEP) or alkaline phosphatase (AP) , hexokinase 

in conjunction with glucose-6-phosphate dehydrogenase, and 
NAD oxidoreductase with lucif erase and substrates NADH and 
FMN or peroxidase with luminol and substrate peroxide. 
These and other appropriate label systems and methods for 
20 coupling -them to antibodies or peptides are known to those 

of skill in the art. 

Antibodies may also be used therapeutically as 
targeting agents to deliver virus-toxic or infected cell- 
toxic agents to infected cells. Rather than being 

25 associated with labels for diagnostic uses, a therapeutic 
agent employs the antibody linked to an agent or ligand 
capable of disabling the replicating mechanism of the virus 
or of destroying the virally-inf ected cell. The identity 
of the toxic ligand does not limit the present invention. 

3 0 It is expected that preferred antibodies to peptides 

encoded by the S ; genes identified herein may be screened 
for the ability to internalize into the infected cell and 
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X* Diagnostic Reagents and Assays 

The nucleotide sequences, amino acid fragments 
and antibodies described above may be employed as 
diagnostic reagents for use in a variety of diagnostic 
5 methods according to this invention. 

PCR Diagnostic Assays « 
For example, these sequences can be utilized in 

* 

a diagnostic method employing the polymerase chain reaction 
(PCR) technique to identify the presence of a CCV or CCV- 

10 like virus and in therapy of infected animals. 

In addition to those sequences identif ied above^, 
the oligonucleotide sequences that were designed to prime 
cDNA synthesis at specific sites within the CCV S genevas 
described in detail below in Example 3 [SEQ ID NO: 46-50] ; 

15 may also be employed as diagnostic reagents according to 

this invention. These sequences, as well as the below^ 
described optimized conditions for the PCR amplification of 
CCV fragments therefrom, may also be employed in a 
diagnostic method. 

The PCR technique is known to those of skill; in 
the art of genetic engineering and is described in detail 
in Example 4 [see, e.g., R* K. Saiki et al, Science , 
£30:1350-1354 (1985)], which is incorporated herein by 
reference. Briefly described, PCR employs two 

25 oligonucleotide primers which are complementary to the 

opposite strands of a double stranded nucleic acid of 
interest whose strands are oriented such that when they are 
extended by DNA polymerase, synthesis occurs across the 
region which separates the oligonucleotides. By repeated 
cycles of heat denaturation, annealing of the primers to 
their complementary sequences and extension of the annealed 
primers with a temperature stable DNA polymerase, millions 
of copies of the target gene sequence are generated. The 
template for the reaction is total RNA, which is isolated 



20 



30 
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from CCV infected cells* DNA fragments generated by PCR 
were amplified from cDNA which had been synthesized from 
this RNA. Other strains of CCV or CCV-related sequences 
may also provide PCR templates in a similar manner • 
5 In one diagnostic method , for example , 

heterogenous CCV gene sequences of this invention are 
useful as reagents in diagnostic assays to detect and 
distinguish the presence of specific viruses from each 
other, e.g-. , to distinguish one canine coronavirus strain 

10 from another or one species of coronavirus from another by 

means of conventional assay formats. For example , using 
protocols similar to those used for forensic purposes , 
tissue or blood samples from a dog suspected to be infected 
with CCV would be subjected to PCR amplification with a 

15 selected CCV-specific set of primers r such as those DNA 

sequences disclosed herein. Amplification of DNA from a 
sample tissue or biological fluid of the animal suspected 
of infection using nucleotide sequences as primers specific 
for regions of the CCV viral gene sequences could correlate 

20 to the presence of CCV. Absence of ccv in the sample would 

result in no amplification. Similarly, the selection of 
specific sets of s gene primers would allow the 
identification of a particular strain of ccv as well. 
Thus, appropriate treatments may be selected for the 

25 infected animal. 

Example 3 provides oligonucleotide primers which 
permitted the synthesis of regions of the CCV S gene. The 
nucleotide sequence of the S gene of CCV provides desirable 



3 0 - example, the sequences between nucleotide base pairs 900 to 
about 1600 [SEQ ID HO: 55] and about 2500 to about 3900 
[SEQ ID NO: 56] of SEQ ID NO: 1. Smaller or larger DNA 
fragments in these regions may also be employed as PCR 
primers or hybridization probes. 
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It is desirable to have PGR primer sequences 
between 15 to 30 bases in length f with an intervening 
sequence of at least 100 bases to as large as 5000 bases 
there between, according to conventional PCR technology. 
5 However, it is possible that larger or smaller sequence 
lengths may be useful based upon modifications to the PCR 
technology • In general, in order to achieve satisfactory 
discrimination, a hybridization or oligonucleotide probe 
made up of one or more of these sequences would consist of 
10 between 15 and 50 bases in length based on current 

technology. 

B . Conventional Assay Formats 

The CCV S proteins or peptide fragments may alao* 
be employed in standard diagnostic assays which rely on S 

15 protein immunogens as targets for sera recognition. The 
diagnostic assays may be any conventionally employed assay;; 
e.g. , a sandwich ELISA assay, a Western blot, a Southern 
blot and the like. Because a wide variety of diagnostic 
methods exist and are conventionally known which can be 

20 adapted to the use of the nucleotide and amino acid- 

sequences described herein, it should be understood that 
the nature of the diagnostic assay does not limit the use 
of the sequences of. this invention. 

For example, the amino acid sequences encoded by 

25 CCV S gene sequences, such as those appearing in Tables I 
and II above, which may be amplified by PCR, provide 
peptides useful in such diagnostic assays as ELISA or 
Western assay, or as antigens for the screening of sera or 
development of antibodies. 

3 0 For example , the sequences between abouir amino 

acid 1 to about 250 [SEQ ID N0:57], about 450 to about 650 
[SEQ ID NO: 58], and about 900 to about 1150 [SEQ ID NO: 59] 
of the CCV strain 1-71 S gene protein SEQ ID NO: 2, are 
anticipated to be useful as such antigens. Such peptides 

35 can optionally also be used in the design of synthetic 
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peptide coupled to a carrier for diagnostic uses, e.g. , 

f 

antibody detection in sera. Suitable carriers include 
ovalbumin, keyhole limpet hemocyanin, bovine serum albumin, 
sepharose beads and polydextran beads. 
5 Such peptide antigens and antibodies to these 

peptides would react positively with tissue or serum 
samples of dogs infected with CCV, but negatively with non- 
CCV infected dogs. These antibodies are discussed in more 
detail below. 

10 For example, the invention provides a method of 

using the full length CCV S protein or fragments thereof as 
diagnostic agents for identifying the presence or absence 
of antibodies in previously ..exposed, naive : or vaccinated 
dogs, respectively, as well as for differentiating exposure 

15 to CCV- from other, related coronaviruses . Other S peptides 

or fusion proteins which show differential reactivity to 
CCV and other coronavirus sera may also be useful as CCV- 
specific reagents in EIiISA-based screening assays to detect 
CCV exposure in dogs. Similarly , an S protein or peptide 

20 which contains epitopes recognized only by sera from CCV 

infected dogs or by sera from CCV positive dogs could be 
employed to distinguish or differentiate among coronavirus 
infections . 

As one assay format, the reactivity of affinity 
25 purified CCV S proteins or peptides fragments to canine 
biological fluids or cells can be assayed by. Western blot. 
The assay is preferably employed on sera, but may also be 
adapted to be performed on other appropriate fluids or 
cells , for example, macrophages or white blood cells. In 
30 the Western blot technique, the purified protein * separated * 

by a preparative SDS polyacrylamide gel, is . transferred to 

3- 
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nitrocellulose and cut into multiple strips. The strips 
are then probed with dog sera from uninfected or infected 
dogs. Binding of the dog sera to the protein is detected 
by incubation with alkaline phosphatase tagged goat anti- 
5 dog XgG followed by the enzyme substrate BCIP/NBT. Color 
development is stopped by washing the strip in water. 

ccv S protein or fragments thereof may also be 
used in an ELISA based assay for detecting CCV disease. A 
typical ELISA protocol would involve the adherence of 

10 antigen (e.g., a S protein) to the well of a 96-well tray. 

The serum to be tested is then added. If the serum 
contains antibody to the antigen, it will bind. 
Specificity of the reaction is determined by the antigens: 
absorbed to the plate. With the S protein, only sera from- 

15 those dogs infected with CCV would bind to the plate; sera 
from naive or uninfected dogs would not bind. 

Similarly, a CCV S protein or peptide which 
contained epitopes recognized only by sera from CCV- 
infected dogs or by sera from CCV-positive dogs could be 

2 0 employed to distinguish coronavirus infections. After, the.: 

primary antibody is bound, an enzyme-labeled antibody 
directed against the globulin of the animal whose serum is 
tested is added. Substrate is then added. The enzyme 
linked to antibody bound to the well will convert the 
25 substrate to a visible form. The amount of color measured 

is proportional to the amount of antibody in the test 
material. In this manner, dogs infected with CCV can be 
identified and treated, or dogs naive to the virus can be 
protected by vaccination. 

3 0 When used as diagnostic reagents, the primers, 

probes, peptide antigens, nucleotide sequence encoding or* 
flanking a CCV S protein or fragment of the invention, and 
antibodies of this invention may be optionally associated 
with detectable labels or label systems known to those 
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skilled in the art. Such labelled diagnostic reagents may 
be used to assay for the presence of CCV in dogs in 
hybridization assays or in the PGR technique as described 
above . 

5 Cj. Diagnostic Kits 

The assay methods, PCR primers , CCV S nucleotide 
sequences [SEQ ID NO:l], S proteins and peptides, and 
antibodies described herein may be efficiently utilized in 
the assembly of a diagnostic kit, which may. be used by 

10 veterinarians or laboratories* The kit is useful in 

distinguishing between CCV infected animals and vaccinated 
animals, as well as non-exposed dogs, and between CCV- 
infected animals and animals infected with serologically 
related viruses, such as other CCV or FIPV, TGEV, and FECV. 

15 Such a diagnostic kit contains the components necessary to 



Thus, the kit may contain a sufficient amount of 
at least one CCV S protein, fusion protein or peptide 
fragment, at least one CCV S gene nucleotide sequence or 
20 PCR primer pair of this invention, a MAb directed to a 

first epitope on the CCV S protein (which Mb may be 
labeled) , optional additional components of a detectable 
labelling system* vials for containing the serum samples, 
protein samples and the like, and a second MAb conjugated 
25 to the second enzyme,- which in proximity tq the first 

enzyme, produces a visible product. Other conventional 
components of such diagnostic kits may also be included. 

Alternatively, a kit may contain a selected CCV 
S protein or peptide, a Jtab directed against a selected CCV 
30 s peptide fragment bound to a solid surface and associated 

with a first enzyme, a different mh associated with a 
second enzyme, and a sufficient amount of the substrate for 
the first enzyme, which, when. added to the serum and. MAbs, 
provides the reactant for the second enzyme, resulting in 
35 the color change. 
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Other known assay formats will indicate the 
inclusion of additional components for a diagnostic 3cit 
according to this invention. 

The following examples illustrate the embodiments 
5 of this invention and do not limit the scope of the present 
invention. 

Example 1 - Isolation of CCV 

Canine coronavirus strain 1-71 was isolated in 
1971 from military dogs suffering from a viral 
10 gastroenteritis by Binn et al. , Proceeding 78th Annual 

Meeting U.S. Animal Health Association , October 1974, p. 

3 59-366. The initial isolate from the feces of* : *th# 
infected dog was grown in tissue culture on the PrDKTCA72 
dog cell line [ATCC No. CRL 1542]. The coronavirus strain 

15 used in this study was received from the ATCC (ATCC 

#VR-809, CCV Strain 1-71, Frozen lot#4, Passage 7/PDK, 17 
May 1988) and passaged five times on PrDKTCA72. 

Example 2 - RNA purification 

After the fifth passage the infected cells were 

20 processed for RNA isolation by infecting a 1700 cc 2 roller 
bottle with a CCV inoculum. The inoculum was prepared by 
diluting 2,5 pi of infected fluids from a confluent 
monolayer into 13.0 mis of media. One ml of this material 
was used to infect a roller bottle and the cells were grown 

25 until they demonstrated a pronounced cytopathic effect at 

4 8 hours. The infected monolayers were harvested and total 
cytoplasmic HNA was extracted using the guanidinium 
thiocyanate procedure as described in Chirgwin et al., 
Biochem. , 18:5294 (1979). 
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Example 3 - Primers Used fo r PCR Amolif ication of CCV Spike 
Gene Fragments 

The primers appearing- below in Table III were 
synthesized conventionally by the phosphoramidite method 
5 and gel purified prior to use. Primer #3 045 was based on 

an FECV S gene sequence; and primers #492 0, 1923, 2443 and 
2600 were based on WT FIPV WSU 1146 sequences. 



Table III 

1 0 Amp 1 if i ed Cloned Top Bottom 

S Gene Region Region Primer Primer 



1-362 aa 1-352 aa # 3045 # 4920 

352-1452 aa 352-1452 aa # 2600 # 1923 

15 1-555 aa 128-555 aa # 3045 # 24-43 

* 

Primer # DNTA Sequence 

1923 TAAATATCSCXTTAGTGGACATGCACTTTTTCAATTGG 
[SEQ ID NO: 46] Stul 

20 2443 TTAGTAGGCCTGTCGAGGCTATGGGTTGACCATAACCAC 

[SEQ XD NO: 47] Stul 

2 6 0 0 CAGATCCCGGGTGTACAATCTGGTATGGGTGCTACAG 
[SEQ ID NO: 48] Xmal 

3 045 GTGCCCCCGGGTATGATTGTGCTCGTAACTTGCCTCTTG 
25 [SEQ ID NO: 49] XmaX 

4 9 2 0 AGCACCCATACCAGATTGTACA TCTGCAGT GAAATTAAGATTG 
[SEQ ID NO: 50] Pstl 



30 



Example 4 - PCR Amplification of CCV S Gene 

PCR amplified fragments of CCV S gene were 
generated using the following procedure. All PCR reagents 
were supplied by Perkin Elmer-Cetus, NorwaUc, CT. In a 
final reaction, volume of 20 pi of IX HI buffer (5X RT 
buffer: 250 mM Tris-HCl, pH 8.3, 375 mM KC1, 15 mM MgCl 2 ) , 
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the following components were assembled in RNAse-free 
siliconized 500 /il microcentrifuge tubes: 1.0 mM of each 
dNTP, 20 units of RNAsin [Promega Corp, Madison, WI], 2.5 
picomoles of random hexamer oligonucleotides [Pharmacia, 
5 Milwaukee, WI] , 100 picomoles//il solution in TE buffer (10 
mM Tris-HCl, 1 mM EDTA, pH 7.5) , 200 units of reverse 
transcriptase [Superscript RT, Bethesda Research Labs, 
Gaithersburg, MD] and 1.0 /ig of respective RNA isolated as 
described above in Example 3. To avoid pipetting errors 

10 and contamination, all solutions were aliguoted from master 

mixes made with diethyl pyrocarbonate (DEPC) treated water 
and consisted of all of the reaction components except the 
RNA which was added last. 

The mixture was incubated in a programmable 

15 thermal cycler [Perkin-Elmer Cetus, Norwalk, CT] at 21 °C 
for ten minutes followed by 42 °C for one hour then 95°C for 
five minutes and finally held at 4°C until PCR 



Amplification of the cDNA was performed 
20 essentially according to the method of R. K. Saiki et al, 

Science , 230:1350-1354 (1985) using the Tag polymerase. 
Briefly, to the 20 jx\ cDNA reaction mix from above was 
added 10.0 /il 10X PCR buffer, 1.0 /ul of each upstream and 
downstream primer previously diluted in water to 3 0 
25 picomoles per microliter and 2.5 units of Taq polymerase 
(Perkin-Elmer Cetus, Norwalk, CT) . Final volume was made 
up to 100 /tl using DEPC treated water and overlaid with 10 0 
Ml of mineral oil. As above, master mixes were prepared to 
avoid contamination. The reaction was performed in the 
3 0 Perkin-Elmer Cetus thermal cycler for one cycle by 

denaturing at 95 °c for 1 minute, annealing at 37 °C for 3 
minutes followed by an extension at 72 °C for 40 minutes. 
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This initial cycle increased the likelihood of first strand 
DNA synthesis . A standard PCR profile was then performed 
by a 95 °c for 1 minute denaturation, 37 °C for 3 minutes 
annealing, 72 °C for 3 minutes extension for 40 cycles. A 
5 final extension cycle was done by 95 °C for 1 minute 

denaturation , 37 °C for 2 minutes annealing, 72 °c for 15 
minutes extension and held at 4°C until analyzed. 

PGR products were analyzed by ele'ctrophoresing 
5.0 ill of the reaction on a 1.2% agarose gel for 16-17 

10 hours. Bands were visualized by ethidium bromide staining 

the gel and fluorescence by UV irradiation at 256 nm. 
Photography using Polaroid type 55 film provided a negative 
that could be digitized for sample distance migration and 
comparison against markers run on each gel. The actual 

15 sizes of the bands were then calculated using the Beckman 

* 

Microgenie software running on an IBM AT. 



5 - Cl oning of CCV Spike Gene Regions 

Cloning procedures were performed substantially 
as described by Kaniatis efc al , cited above* Details of 
20 the clonings are provided in the following examples* Calf- 

alkaline phosphatase was from Bethesda Research Labs 
(Gaithersburg, MD) . Ligation products were transformed 
into JS. coll host strain XL1 Blue [Stratagene Cloning 
Systems, La Jolla, CA]. pBluescript SK^tl3-phagemid vector 
25 was also obtained "from Stratagene Cloning Systems. All 

restriction enzymes were purchased from New England Biolabs 
(Beverly, MA) or Bethesda Research Labs (Gaithersburg, MD) 
and used according to manufacturer's specifications. T4 
DNA ligase was received from Boehrihger Mannheim 
30 Bibchemicals (Indianapolis, lifj . calf intestinal alkaline 

phosphatase was purchased from Bethesda Research Labs . 
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Example 6 - ccv s Protein Fragment. A. A. 1-128 rSEO ID 
NO: 511 

Five microliters (approximately 200 ng) of PCR- 
amplified DNA representing amino acids 1-3 62 [SEQ ID NO: 53] 
5 of the CCV spike gene were ligated to the pT7Blue T-Vector 
(Novagen, Madison, WI) as per the manufacturer's 
instructions, one microliter of the ligation mix was used 
to transform NovaBlue competent cells (Novagen) and 
transformation mixes were plated on LB plates supplemented 

10 with ampicillin, isopropylthio-j8-galactoside (IPTG; Sigma 

Chemical Co., St. Louis, MO), and 5<-bromo-4-chloro-3;r 
indolyl-/3-D-galactoside (X-gal; Sigma Chemical Co,, St. 
Louis, MO). White colonies were picked and screened:, ,by£, 
restriction analysis of mini-prep DNA. Insert-bearing 

15 clones were identified and oriented with respect to vector 

•by Sma l/ Pst I, Stul, and PstI digests. Clone #2 964 
contained a full-length 1-362 amino acid insert and was 
used to provide sequence analysis from 1-128 amino acids of 
the CCV s gene* 

20 

Example 7 - CCV S Protein Fragment. A. A. 128-555 rSEOoED 
NO:431 

10 fil of PCR DNA encoding l-555aa of the CCV 
spike protein was digested with Sma l/ Stu I for 4 hours at 

25 room temperature. DNA bands were isolated and purified 
from low-melting temperature agarose gels as described by 
Maniatis et al, cited above. Briefly, DNA fragments were 
visualized after staining with ethidium bromide, excised 
from the gel with a scalpel and transferred to microfuge 

30 tubes. Gel slices were incubated 5 min at 65 °C, vortexed, 
and 5 volumes of 20 mM Tris, pH 8.0, 1 mM EDTA were added. 
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Samples were incubated an additional 2 minutes at 65 °C and 
were then extracted once with phenol and again with 
phenol: chloroform. The DNA was precipitated with 1/10 
volume 3 M: NaOAc , pH 7.0, and 2.5 volumes of cold 95% EtOH 
5 overnight at -20°C. Insert DNAs were ligated to SK n M13- 

sxnaX-digested, dephosphorylated vector [Stratagene] for 4 
hours at room temperature* Insert-bearing clones were 
identified by Xho I/ SstI and Bal l digests of mini-prep DNA. 
Restriction enzyme and sequence analysis indicated that the 
10 cloned insert was short by ~300bp due to the presence of a 

StuI site at amino acid #128 of the CCV spike gene. 
Therefore, these clones contained the CCV s protein 
spanning amino acids from about 128-555 [SEQ ID N0r43]. 

Example 8 - CCV' S Protein Fragment r A. A. 352-1452 TSEO ID 
15 NO r 52 1 

PCR-^amplif ied DNA fragments encoding amino acids 
352-1454 of the CCV spike protein were purified using 
Prime-Erase Quik Columns [Stratagene] according to the 
manufacturer's instructions. Column-pur if ied DNAs were 

20 then digested, with Xmal^EcoR V overnight at 15 °C and 

subsequently isolated and eluted from low-smelting 
temperature agarose gels as described by Maniatis et al, 
cited above. inserts were ligated overnight at 15 °C to 
SK n Ml3- IQnal /StuX digested, dephosphorylated vector 

25 [Stratagene] . Clones were identified and oriented with 

respect to vector by XhoX / Sst X and PvuX X digests of 
mini-prep DNAs, respectively. 

Example 9 - DNA Sequencing 

DNA sequence for the CCV S gene was determined 
30 from the individual clones #1775 (AA 352-1452; SEQ ID 

NO*52), #2007 (AA 128-555; SEQ ID NO:43) and #2964 (AA 1- 

352; SEQ XD Ko:53) . -Nested set deletions were prepared 
from each clone or internal primers synthesized to 
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facilitate primer walking and the sequence determined from 
both strands [Lark Sequencing Technologies, Houston, TX] . 
The chain termination method performed as described in 
Sanger et al, Proc. Natl. Acad. Sci. USA , 74:5463-5467 
5 (1977) was used to determine the sequence of all clones. 

The full length sequence of the CCV S gene was assembled 
from overlapping sequences of each of the three separate 

* 

fragments by computer analysis. 

DNA sequence analysis was performed using either 

10 Beckman Microgenie programs on an IBM Model PS/2 Model 70 

or the University of Wisconsin GCG package of programs 
implemented on a DEC VAX cluster [Devereau et al-, (1984)]. 

SEQ ID N0:l is the complete nucleotide sequence 
of the CCV strain 1-71 S gene. The amino acid [SEQ ID~ 

15 NO: 2] and nucleotide sequences [SEQ ID NO : 1 ] of CCV 1-71 
total 1452 amino acids and 4356 base pairs. CCV 1-71 has 
a DNA homology of 90.8% to published FIpV strain WT WSU 
1146 , 93.2% identity with FIPV strain DF2 and 94.1% 
similarity with FECV. In comparison to WSU 1146 , this CCV 

20 strain further contains two amino acid deletions at. 

positions 11 and 12, and two amino acid insertions * at 
positions 118 and 119. In comparison to the amino acid 
sequences of other coronavirus s genes, the amino acid 
sequence of CCV is 82.2% homologous to TGEV, 89.7% 

25 homologous to DF2-HP, 90.0% homologous to TS-BP, 92.9% 

homologous to TS, 93*2% homologous to DF2, and 94.1% 
homologous to FECV • 

The canine coronavirus S gene encoding amino 
acids #22 5-13 25 [SEQ ID NO: 54] has an overall homology to 

30 the published WT FIPV WSU 1146 strain at amino acids 352 to 

1454 of 95.9%. The homology level is increased to 97.5% 
when the comparison is done under the amino acid similarity 
rules as proposed by M. o. Dayhoff , Atlas of Protein 
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Sequence and Structure, Vol. 5, Supp. 3, Natl. Biomed. Res. 
Found., Washington, DC (1978) . There are 42 amino acid 
differences between the CCV S gene and the published 
sequence of WSU 1146 strain within the CCV sequence of SEQ 
ID NO: 2. Other CCV fragment homologies with WT FIPV WSU 
1146 are illustrated in Table II above. 

Numerous modifications and variations of the 
present invention are included in the above-identified 
specification and are expected to be obvious to one of 
sfcill in the art. Such modifications and alterations to 
the compositions and processes of the present invention are 
believed to be encompassed in the scope of the claims 
appended hereto. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Miller, Timothy J. 

Klepfer r Sharon 
Reed, Albert Paul 
Jones, Elaine V. 

(ii) TITLE OF INVENTION: Canine Coronavirus S Gene and Dses 
Therefor 

(iii) NUMBER OF SEQUENCES: 59 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation - Corporate 

Patents 

(B) STREET: 709 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) 2IP: 19406-2799 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC— DOS /MS— DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1-25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/880,194 

(B) FILING DATE: 08-MAY-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/698,927 

(B) FILING DATE: 13-MAY-1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/613,066 

(B) FILING DATE: 14-NOV-1990 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Schreck, Patrica A. 

(B) REGISTRATION NUMBER: 33,777 

(C) REFERENCE /DOCKET NUMBER: SBC H85010-1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE i (215) 270-5015 

(B) TELEFAX: (215) 270-5090 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: DNA {genomic) 

(ix) FEATURE: 

(A) NAME/KEZt CDS 

(B) LOCATION; 1..4356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l; 

ATG ATT GTG CTC GTA ACT TGC CTC TTG TTT TCG TAC AAT AGT GTG ATT 
Met He Val Leu Val Thr Cys Leu Leu Phe Ser Tyr Asn Sex Val lie 
1 5 10 15 

TGT ACA TCA AAC AAT GAC TGT GTA CAA GTT AAT GTG ACA CAA TTG CCT 
cys Thr ser Asn Asn Asp cys Val Gin Val Asn Val Thr Gin Leu Pro 

20 25 30 



GGC AAT GAA AAC ATT ATT AAA GAT TTT CTA TTT CAC ACC TTC AAA GAA 
Gly Asn Glu Asn lie Tie Lys Asp Phe Leu Phe His Thr Phe Lye Glu 
35 40 



48 



96 



144 



GAA GGA AGT GTA GTT GTT GGT GGT TAT TAC CCT ACA GAG GTG TGG TAT 
Glu Gly Ser Val Val Val Gly Gly Tyr Tyr Pro Thr Glu Val Trp Tyr 
50 .55 60 



AAC TGC TCC AGA AGC GCA ACA ACC ACC GCT TAC AAG GAT TTT AGT AAT 
Asn Cys Ser Arg Ser Ala Thr Thr Thr Ala Tyr Lys Asp Phe Ser Asn 

70 75 ~ 80 



ATA CAT GCA TTC TAT TTT GAT ATG GAA GCC ATG GAG 

lie His Ala Phe Tyr Phe Asp Met Glu Ala Met Glu 

85 90 



AGT ACT GGC 

ser Thr Gly 
95 



CTA GTA CAT GTT CAT GGT GAT CCT GTT 
Leu Val His Val His Gly Asp Pro Val 

110 



TAT AGA GAT GAT GTG CAA GGA AGG 
Ala Tyr Arg Asp Asp Val Glh Gly Arg 
120 



AAT GCA CGA GGT AAA CCT 
Asn Ala Arg Gly Lys Pro 

100 

AGT ATC ATC ATA TAT ATA 
Ser lie lie lie Tyr lie 
115 

CCT CTT TTA AAA CAT GGT TTG TTG TGT ATA ACT AAA AAT AAA ATC ATT 
Pro Leu Leu Lys His Gly Leu Leu Cys lie Thr Lys Asn Lys lie He 
130 135 140 

AGT GCC ATA TGT TTG GGT 
Ser Ala He Cys Leu Gly 

160 



GAC TAT AAC ACG TTT ACC AGC GCA GAG 
Asp Tyr Asn Thr Phe Thr Ser Ala Gin 
145 150 



GAT GAC AGA AAA ATA CCA TTC TCT GTC ATA CCC ACA GGT AAT GCT ACA 
Asp Asp Arg Lys lie Pro Phe Ser Val lie Pro Thr Gly Asn Gly Thr 

165 170 175 

AAA ATA TTT GGT CTT GAG TGG AAT GAT GAC TAT GTT ACA GCC TAT ATT 

Lys He Phe Gly Leu Glu Trp Asn Asp Asp Tyr Val Thr Ala Tyr He 

180 185 190 

AGT GAT CGT TCT CAC CAT TTG AAC ATC AAT AAT AAT TGG TTT AAC AAT 
Ser Asp Arg Ser His His Leu Asn He Asn Asn Asn Trp Phe Asn Asn 
195 200 2Q5 

GTG ACA ATC CTA TAC TCT CGA TCA AGC ACT GCT ACG TGG CAG AAG AGT 

Val Thr He Leu Tyr Ser Arg Ser ser Thr Ala Thr Trp Gin Lys Ser 
210 215 220 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 
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GCT GCA TAT GTT TAT CAA GGT GTT TCA AAT TTT ACT TAT TAC AAG TTA 720 

Ala Ala Tyr Val Tyr Gin Gly Val Ser Asn Phe Thr Tyr Tyr Lys Leu 

225 230 235 240 

AAT AAC ACC AAT GGC TTG AAA AGC TAT GAA TTG TGT GAA GAT TAT GAA 768 

Asn Asn Thr Asn Gly Leu Lye Ser Tyr Glu Leu Cya Glu Asp Tyr Glu 

245 250 255 

TGC TGC ACT GGC TAT GCT ACC AAC GTA TTT GCC CCG ACA GTG GGC GGT 816 

Cys Cys Thr Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr Val Gly Gly 

260 265 270 

TAT ATA CCT GAT GGC TTC AGT TTT AAC AAT TGG TTT ATG CTT ACA AAC 864 

Tyr lie Pro Asp Gly Phe Ser Phe Asn Asn Trp Phe Met Leu Thr Asn 
275 280 285 

AGT TCC ACG TTT GTT AGT GGC AGA TTT GTA ACA AAT CAA CCA TTA TTG 912 

Ser Ser Thr Phe Val Ser Gly Arg Phe Val Thr Asn Gin Pro Leu Leu 
290 295 300 

GTT AAT TGT TTG TGG CCA GTG CCC AGT CTT GGT GTC GCA GCA CAA GAA 960 

Val Asn Cys Leu Trp Pro Val Pro Ser Leu Gly Val Ala Ala Gin Glu 

305 ~ 310 315 320 

TTT TGT TTT GAA GGT GCG GAG TTT AGC CAA TGT AAT GGT GTG TCT TTA 1008 

Phe Cys Phe Glu Gly Ala Gin Phe Ser Gin Cys Asn Gly Val Ser Leu 

325 330 335 

AAC AAT ACA GTG GAT GTC ATT AGA TTC AAC CTT AAT TTT ACC ACA GAT 1056 

Asn Asn Thr Val Asp Val lie Arg Phe Asn Leu Asn Phe Thr Thr Asp 

340 345 350 

GTA CAA TCT GGT ATG GGT GCT ACA GTA TTT TCA CTG AAT ACA ACA GGT 1104 

Val Gin Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr Gly 
355 360 365 

GGT GTC ATT CTT GAG ATT TCT TGT TAT AAT GAT ACA GTG AGT GAG TCA 1152 

Gly Val lie Leu Glu lie Ser Cys Tyr Asn Asp Thr Val Ser Glu Ser 
370 375 380 

AGT TTC TAC AGT TAT GGT GAA ATT TCA TTC GGC GTA ACT GAT GGA CCG 1200 

Ser Phe Tyr Ser Tyr Gly Glu lie Ser Phe Gly Val Thr Asp Gly Pro 

385 390 395 400 

CGT TAC TGT TAC GCA GTC TAT AAT GGC ACG GCT CTT AAG TAT TTA GGA 1248 

Arg Tyr Cys Tyr Ala Leu Tyr Asn Gly Thr Ala Leu Lys Tyr Leu Gly 

405 410 415 

ACA TTA CCA CCT AGT GTC AAG GAA ATT GCT ATT AGT AAG TGG GGC CAT 1296 

Thr Leu Pro Pro Ser Val Lys Glu lie Ala lie Ser Lys Trp Gly His 

420 425 430 

TTT TAT ATT AAT GGT TAC AAT TTC TTT AGC ACT TTT CCT ATT GAT TGT 1344 

Phe Tyr He Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro lie Asp Cys 
435 440 445 

ATA TCT TTT AAT TTA ACC ACT GGT GAT AGT GGA GCA TTT TGG ACA ATT 1392 

lie ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp Thr lie 
450 455 460 

GCT TAC ACA TCG TAC ACT GAC GCA TTA GTA CAA GTT GAA AAC ACA GCT 1440 

Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Glu Asn Thr Ala 

465 470 475 480 
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ATT AAA AAG GTG ACG TAT TGT AAC AGT CAC ATT AAT AAC ATT AAA TGT 
lie Lys Lys Val Thr Tyr Cys Abu Ser His lie Abu Asn He I,ys Cys 

485 490 495 

TCT CAA CTT ACT GCT AAT TTG CAA AAT GGA TTT TAT CCT GTT GCT TCA 
Ser Gin Leu Thr Ala Aon Leu Gin Asn Gly Phe Tyr Pro Val Ala Ser 

500 505 510 

AGT GAA GTT GGT CTT GTC AAT AAG AGT GTT GTG TTA CTA CCT AGT TTC 

Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro Ser Phe 
515 52G 



TAT TCA CAT ACC AGT GTT AAT ATA 'ACT ATT GAT CTT GGT ATG AAG CGT 
Tyr Ser His Thr Ser Val Asn He Thr lie Asp Leu Gly Met Lys Arg 
530 535 540 

AGT GGT TAT GGT CAA CCC ATA GCC TCA ACA TTA AGT AAC ATC ACA CTA 
Ser Gly Tyr Gly Gin Pro Xle Ala Ser Thr Leu Ser Aen He Thr Leu 
545 550 555 560 



CCA ATG CAG GAT AAT AAC ACC GAT GTG TAC TGC ATT CGT TCT AAC CAA 

Pro Met Gin Asp Asn Asn Thr Asp Val Tyr Cys He Arg Ser Asn Gin 

570 



GAT 
Asp 



GTG TTT AAT TCC GAC TGC ACA GAT GTT TTA TAT GCT ACA GCT GTT ATA 
Val Phe Asn Ser Asp Cys Thr Asp Val Leu Tyr Ala Thr Ala Val Xle 
595 600 



TTT TCA GTT TAC GTT CAT TCC ACT TGT AAA AGT TCT TTA TGG 
Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu Trp 

5S0 585 590 



AAA ACT GGT ACT TGT CCT TTC TCG TTT GAT AAA TTG AAC AAT TAC TTA 
Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr Leu 
610 615. 620 

TGC 

cys 
640 

AAG TTT GAT GTT GCC GCT CGT ACA AGA ACC AAT GAG CAG GTT GTT AGA 
Lys Phe Asp Val Ala Ala Arg Thr Arg Thr Asn Glu Gin Val Val Arg 

645 650 



ACT TTT AAC AAG TTC TGT TTG TCA TTG AAT CCT GTT GGT GCC 
Thr Phe Asn Lys Phe Cys Leu Ser Leu Aen Pro Val Gly Ala 
625 630 



AGT TTA TAT GTA ATA TAT GAA GAA GGA GAC AAC ATA GTG GGT GTG CCG 

Ser Leu Tyr Val lie Tyr Glu Glu Gly Asp Asn lie Val Giy Val Pro 

660 665 670 

GTG CTA CAC TTA GAC TCC 
Val Leu His Leu Asp Ser 



TCT GAC AAT AGT GGT CTT CAC GAC TTG TCA 
Ser Asp Asn Ser Gly Leu His Asp Leu Ser 
675 680 



TGT ACA GAT TAT AAT ATA. TAT GGT AGA 
Cys Thr Asp Tyr Asn lie Tyr Gly Arg 
690 



ACT GGT GTT GGT ATT ATT AGA 
Gly Val Gly He He Arg 
700 



CAA ACT AAC AGT ACG CTA CTT 
Gin Thr Asn Ser Thr Leu Leu 
705 710 

GGT GAC TTG TTA GGG TTT AAA 
Gly Asp Leu Leu Gly Phe Lys 

725 



AGT GGC TTA TAT 



GTT AGT 
Val Ser 
730 



TAC ACA TCA CTA TCA 

Tyr Thr Ser Leu Ser 

720 

GGT GTC ATC TAT TCT 
Gly Val He Tyr Ser 



1488 



1536 



1584 



1632 



1680 



1728 



177 6 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 
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GTC ACG CCA TGT GAT GTA AGC GGA CAA GCT GCT GTT ATT GAT GGC GCC 2256 

Val Thr Pro cya Asp Val ser Ala Gin Ala Ala Val lie Asp Gly Ala 

740 745 750 

ATA GTT GGA GCT ATG ACT TCC ATT AAT AGT GAA ATG TTA GGT CTA ACA 2304 
He Val Gly Ala Met Thr Ser He Asn Ser Glu Met Leu Gly Leu Thr 
755 760 765 

CAT TGG ACA ACA ACA CCT AAT TTT TAT TAT TAT TCT ATA TAT AAT TAT 2352 
His Trp Thr Thr Thr Pro Asn Phe Tyr Tyr Tyr Ser lie Tyr Asn Tyr 
770 775 780 

ACC AAT GAA AGG ACT CGT GGC ACA' GCA ATT GAT AGT AAC GAT GTT GAT 2400 
Thr Asn Glu Arg Thr Arg Gly Thr Ala lie Asp Ser Asn Asp Val Asp 
785 790 795 800 

TGT GAA CCT ATC ATA ACC TAT TCT AAT ATA GGT GTT TGT AAA AAT GGA 2448 

Cys Glu Pro He He Thr Tyr Ser Asn He Gly Val Cya Lye ABn Gly 

805 810 815 

GCT TTG GTT TTT ATT AAC GTC ACA CAT TCT GAT GGA GAC GTT CAA CCA 2496 
Ala Leu Val Phe He Asn Val Thr Hie Ser Asp Gly Asp Val Gin Pro 

820 825 830 

ATT AGC ACC GGT AAT GTC ACG ATA CCT ACA AAT TTT ACC ATA TCT G^G 2544 
He Ser Thr Gly Asn Val Thr lie Pro Thr Asn Phe Thr He Ser Val 
835 840 645 

CAA GTT GAG TAC ATT CAG GTT TAC ACT ACA CCG GTG TCA ATA GAT TGT 2592 
Gin Val Glu Tyr He Gin Val Tyr Thr Thr Pro Val Ser He Asp Cys 
850 855 860 

TCA AGG TAC GTT TGC AAT GGT AAC CCT AGA TGC AAT AAA TTG TTA ACG 2640 
Ser Arg Tyr Val Cys Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu Thr 
865 870 875 *~ 880 

CAA TAC GTT TCT GCA TGT CAA ACT ATT GAG CAA GCA CTT GCA ATG GGT 2688 ' 

Gin Tyr Val Ser Ala CyB Gin Thr He Glu Gin Ala Leu Ala Met Gly 

885 890 895 

GCC AGA CTT GAA AAC ATG GAG ATT GAT TCC ATG TTG TTT GTT TCG GAA 2736 

Ala Arg Leu Glu Asn Met Glu He Asp Ser Met Leu Phe Val Ser Glu 

900 905 910 

AAT GCC CTT AAA TTG GCA TCT GTT GAA GCA TTC AAT AGT ACG GAA ACT 2784 
Asn Ala Leu Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Thr Glu Thr 
915 920 925 

TTA GAT CCT ATT TAC AAA GAA TGG CCT AAC ATT GGT GGT TCT TGG CTA 2832 
Leu Asp PrD He Tyr Lys Glu Trp Pro Abii He Gly Gly Ser Trp Leu 
930 935 940 

GGA GGT TTA AAA GAC ATA TTG CCA TCT CAC AAC AGC AAA CGT AAG TAC 2880 
Gly Gly Leu Lys Asp He Leu Pro Ser His Asn Ser Lys Arg Lys Tyr 
945 950 955 960 

CGG TCG GCT ATA GAA GAT TTG CTT TTT GAT AAG GTT GTA ACA TCT GGC 2928 
Arg Ser Ala He Glu Asp Leu Leu Pho Asp Lys Val Val Thr Ser Gly 

965 970 975 

TTA GGT ACA GTT GAT GAA GAT TAT AAA CGT TGT ACA GGT GGT TAT GAC 2976 
Leu Gly Thr Val Asp Glu Asp Tyr Lys Arg Cys Thr Gly Gly Tyr Asp 

980 985 * 990 



JDOCID: <WO_8323423A1J_> 



WO 93/23423 



PCI7US93/04692 



38 



ATA GCT GAC TTA 6TG TGT GCA CAA TAT TAC AAT 6GC ATC ATG GTO CTA 
lie Ala Asp Leu Val Cys Ala Gin Tyr Tyr Asn Gly lie Met Val Leu 
995 iOOO 1005 



3024 



CCT GGT GTA GCT AAT GAT GAC AAG ATG GCT ATG TAC ACT GCA TCT CTT 
Pro Gly Val Ala Asn Asp Asp Lys Met Ala Met Tyr Thr Ala Ser Leu 
1010 1015 1020 



3072 



GCA GGT GGT ATA ACA TTA GGT GCA CTT GGT GGT GGC GCA GTG TCT ATA 
Ala Gly Gly lie Thr Leu Gly Ala Leu Gly Gly Gly Ala Val Sex lie 
1025 1030 1035 1040 



3120 



CCT TTT GCA ATA GCA GTT CAA GCCAGA CTT AAT TAT GTT GCT CTA CAA 
Pro Phe Ala lie Ala Val Gin Ala Arg Leu Asn Tyr Val Ala Leu Gin 

1Q45 1050 1055 



3168 



ACT GAT GTA TXG AGC AAG AAC CAG GAG ATC CTG GCT AAT GCT TTC AAT 
Thr Asp Val Leu Ser Lys Asn Gin Gin I la Leu Ala Asn Ala Phe Asn 

1060 1065 1070 



3216 



CAA GCT ATT GGT AAC ATT ACA CAG GCA TTT GGT AAG GTT AAT GAT GCT 

Gin Ala Xle Gly Asn He Thr Gin Ala Phe Gly Lys Val Asn Asp Ala 
1075 1080 1085 



3264 



ATA CAT CAA ACG TCA CAA GGT CTT GCT ACT GTT GCT AAA GCA TTG GCA 
He His Gin Thr Ser Gin Gly Leu Ala Thr Val Ala Lys Ala Leu Ala 
1090 1095 1100 



3312 



AAA GTG CAA GAT GTT GTT AAC ACA CAA GGG CAA GCT TTA AGC CAC CTA 

Lys val Gin Asp val Val Asn Thr Gin Gly Gin Ala Leu ser His Leu 
H05 1110 1115 1120 



3360 



ACA GTA CAA TTG CAA AAT AAT TTC CAA GCC ATT AGT AGT TCC ATT AGT 
Thr Val Gin Leu Gin Asn Asn Phe Gin Ala He Ser Ser Ser He Ser 

H30 1135 



3408 



GAC ATT TAT AAC AGG CTT GAT GAA TTG AGT GCT GAT GCA CAA GTT GAC 
Asp lie Tyr Asn Arg iLeu Asp Glu Leii Ser Ala Asp Ala Gin Val Asp 

1140 1145 ~ 1150 



3456 



AGG CTG ATT ACA GGA AGA CTT ACA GCA CTT AAT GCA TTT GTG TCT CAG 
Arg Leu He Thr Gly Arg Leu Thr Ala Leu Asn Ala Phe Val Ser Gin 
H55 1160 1165 



ACT TTA ACC AGA CAA GCA GAG GTT 
Thr Leu Thr Arg Gin Ala Glu Val 
1170 v 1175 



GCT AGC AGA CAG CTT GCT AAA 
Ala Ser Arg Gin Leu Ala Lys 
1180 



3504 



3552 



GAC AAG GTA AAT GAA TGC GTT 
Asp Lys Val Asn Glu Cys Val 

H90 



CAA TCT CAG AGA TTT GGA TTC 
Gin Ser Gin Arg Phe Gly Phe 
1195 " 1200 



3600 



TGT GGT AAT GGT ACA CAT TTA 
Cys Gly Asn Gly Thr His Leu 

1205 



CTT GGA AAT GCA GCA CCA AAT 

Leu Ala Asn Ala Ala Pro Asn 
1210 



3648 



GGC ATG ATC TTC TTT CAC ACA GTG 
Gly Met Xle Phe Phe His Thr Val 

1220 



Leu Pro Thr Ala Tyr Glu Thr 

1230 



3696 



GTG ACG GCC TGG TCA QGT ATT TGT GCA TCA GAT GGC GAT CGT ACT TTT 
Val Thr Ala Trp Ser Gly lie Cys Ala Ser A*p Gly Asp Arg Thr Phe 

1240 1245 



3744 
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GGA CTT GTT GTT AAG GAT GTC CAG TTG ACG CTG TTT CGC AAT CTA GAT 
Gly Leu Val Val Lys Asp Val Gin Leu Thr Leu Phe Arg Aon Leu Asp 
1250 12S5 1260 



3792 



GAC AAA TTC TAT TTG ACT CCC AGA ACT ATG TAT CAG CCT AGA GTT GCA 
Aso Lys Phe Tyr Leu Thr Pro Arg Thr Met Tyr Gin Pro Arg Val Ala 
1265 1270 " 1275 1280 



3840 



ACT ACT TCT GAT TTT GTT CAA ATT GAA GGA TGT GAT GTG TTG TTT GTT 
Thr S©r Ser Asp Phe Val Gin lie Glu Gly Cya Asp Val Leu Phe Val 

1285 1290 1295 

AAT GCA ACT GTA ATT GAC TTG CCT * AGT ATT ATA CCT GAC TAT ATT GAT 
Asn Ala Thr Val lie Asp Leu Pro Ser lie lie Pro Asp Tyr lie Asp 

1300 1305 1310 

ATT AAT CAA ACT GTT CAG GAC ATA TTA GAA AAT TTC AGA CCA AAT TGG* 
lie Asn Gin Thr Val Gin Asp lie Leu Glu Asn Phe Arg Pro Asn Trp 
1315 1320 1325 

ACT GTA CCT GAG TTG CCA CTT GAC ATT TTC AAT GCA ACC TAC TTA AAC 
Thr Val Pro Glu Leu Pro Leu Asp lie Phe Asn Ala Thr Tyr Leu Asn 
1330 1335 1340 



3888 



3936 



3984- 



4032 



CTG ACT GGT GAA ATT AAT GAC TTA GAA TTT AGG TCA GAA AAG TTA CAT 
Leu Thr Gly Glu lie Asn Asp Leu Glu Phe Arg Ser Glu Lys Leu His 
1345 1350 1355 1360 

AAC ACC ACA GTA GAA CTT GCT ATT CTC ATT GAT AAT ATT AAT AAC ACA 
Asn Thr Thr Val Glu Leu Ala lie Leu lie Asp Asn lie Asn Asn Thr 

1365 1370 1375 

TTA GTC AAT CTT GAA TGG CTC AAT AGA ATT GAA ACT TAT GTA AAA TGG 
Leu Val Asn Leu Glu Trp Leu Asn Arg lie Glu Thr Tyr Val Lys Trp 

1380 1385 1390 

CCT TGG TAT GTG TGG CTA CTA ATT GGA TTA GTA GTA ATA TTC TGC ATA 
Pro Trp Tyr Val Trp Leu Leu He Gly Leu Val Val lie Phe Cys He 
1395 1400 1405 

CCC ATA TTG CTA TTT TGT TGT TGT AGC ACT GGT TGT TGT GGA TGT ATT 

Pro He Leu Leu Phe Cys Cys Cys Ser Thr Gly Cys Cys Gly Cys He 
1410 1415 1420 

GGG TGT TTA GGA AGC TGT TGT CAT TCC ATA TGT AGT AGA AGG CGA TTT 
Gly Cys Leu Gly Ser Cys Cys His Ser He Cys Ser Arg Arg Arg Phe 
1425 1430 1435 1440 

GAA AGT TAT GAA CCA ATT GAA AAA GTG CAT GTC CAC TAA 
Glu Ser Tyr Glu Pro He Glu Lys Val His Val His 

1445 1450 



4080 



4128 



4176 



4224*' 



4272 



4320 



4359 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1452 amino 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOr2: 

Met He Val Leu Val Thr Cys Leu Leu Phe Ser Tyr Asn Ser Val Xle 
1 5 10 15 

Cys Thr Ser Asn Asn Asp Cye Val Gin Val Asn Val Thr Gin Leu Pro 

20 25 30 

Gly Asn Glu Asn He lie Lys Asp Phe Leu Phe His Thr Phe Lys Glu 
35 40 45 

Glu Gly Ser Val Val Val Gly Gly Tyr Tyr Pro Thr Glu Val Trp Tyr 
50 55 60 

Asn Cys Ser Arg Ser Ala Thr Thr Thr Ala Tyr Lys Asp Phe Ser Asn 
65 70 75 80 

lie His Ala Phe Tyr Phe Asp Met Glu Ala Met Glu Asn Ser Thr Gly 

85 90 95 

Asn Ala Arg Gly Lys Pro Leu Leu Val His Val His Gly Asp Pro Val 

100 105 110 

Ser He lie lie Tyr lie Ser Ala Tyr Arg Asp Asp Val Gin Gly Arg 
115 120 125 

Pro Leu Leu Lys His Gly Leu Leu Cys lie Thr Lys Asn Lys He He 
130 135 * 140 

Asp Tyr Asn Thr Phe Thr Ser Ala Gin Trp Ser Ala lie Cys Leu Gly 
145 150 155 160 

Asp Asp Arg Lys lie Pro Phe Ser Val lie Pro Thr Gly Asn Gly Thr 

165 170 175 

Lys He Phe Gly Leu Glu Trp Asn Asp Asp Tyr Val Thr Ala Tyr He 

180 185 190 

Ser Asp Arg Ser His His Leu Asn He Asn Asn Asn Trp Phe Asn Asn 
195 200 205 

Val Thr He Leu Tyr Ser Arg Ser Ser Thr Ala Thr Trp Gin Lys Ser 
210 215 220 

Ala Ala Tyr Val Tyr Gin Gly Val Ser Asn Phe Thr Tyr Tyr Lys Leu 
225 230 235 240 

Asn Asn Thr Asn Gly Leu Lys Ser Tyr Glu Leu Cys Glu Asp Tyr Glu 

245 250 



cys cys Thr Gly Tyr Ala. Thr Asn Val Phe Ala Pro Thr Val Gly Gly 

260 265 270 

Tyr He Pro Asp Gly Phe Ser Phe Asa Asn Trp Phe Met Leu Thr Asn 
275 280 285 

Ser Ser Thr Phe Val Ser Gly Arg Phe Val Thr Asn Gin Pro Leu ieu 

290 295 300 



Val Asn Cys Leu Trp Pro Val Pro Ser Leu Gly Val Ala Ala Gin Glu 
305 310 315 320 
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Phe Cya Phe Glu Gly Ala Gin Phe Ser Gin Cys Asn Gly Val Ser Leu 

325 330 335 

Asn Asn Thr Val Asp Val He Arg Phe Asn Leu Asn Phe Thr Thr Asp 

340 345 350 

Val Gin Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr Gly 
355 360 365 

Gly Val He Leu Glu He Ser Cys Tyr Asn Asp Thr Val Ser Glu Ser 
370 375 380 

Ser Phe Tyr Ser Tyr Gly Glu He- Ser Phe Gly Val Thr Aep Gly Pro 
385 390 395 400 

Arg Tyr Cys Tyr Ala Leu Tyr Asn Gly Thr Ala Leu Lye Tyr Leu Gly 

405 410 415 

Thr Leu Pro Pro Ser Val Lys Glu He Ala He Ser Lye Trp Gly His 

420 425 430 

Phe Tyr He Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro lie Asp Cys 
435 440 445 

He Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp Thr lie 
450 455 460 

Ala Tyr Thr Ser Tyr Thr Aep Ala Leu Val Gin Val Glu Asn Thr Ala 
465 470 475 480 

He Lys Lye Val Thr Tyr Cys Asn Ser His He Asn Asn He Lys Cys 

485 490 495 

Ser Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe Tyr Pro Val Ala Ser 

500 505 510 

Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro Ser Phe 
515 520 525 

Tyr Ser His Thr Ser Val Asn He Thr He Asp Leu Gly Met Lys Arg 
530 535 540 

Ser Gly Tyr Gly Gin Pro lie Ala Ser Thr Leu Ser Asn He Thr Leu 
545 550 555 560 

Pro Met Gin Asp Asn Asn Thr Asp Val Tyr Cys He Arg Ser Asn Gin 

565 570 575 

Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu Trp Asp Asp 

580 585 S90 

Val Phe Asn Ser Asp CyB Thr Asp Val Leu Tyr Ala Thr Ala Val He 
595 600 605 

Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn ABn Tyr Leu 
610 615 620 

Thr Phe Asn Lys Phe Cys Leu Ser Leu Asn Pro Val Gly Ala Asn Cys 
625 630 635 640 

Lys Phe Asp Val Ala Ala Arg Thr Arg Thr Asn Glu Gin Val Val Arg 

645 650 655 
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Ser Leu Tyr Val He Tyr Glu Glu Gly Asp Asn He Val Gly Val Pro 

660 665 670 



Ser Asp Asn Ser Gly Leu His Asp Leu Ser Val Leu His Leu Asp Ser 
675 680 685 

Cys Thr Asp Tyr Aan He Tyr Gly Arg Thr Gly Val Gly He He Arg 
690 695 700 



Gin Thr Asn Ser Thr Leu Leu Ser Gly Leu Tyr Tyr Thr Ser Leu Ser 
705 710 715 ~ 720 



Gly Asp Leu Leu Gly Phe Lys Asn 

725 



Ser Asp Gly Val He Tyr Ser 
730 735 



Val Thr Pro Cys Asp Val Ser Ala Gin Ala Ala Val He Asp Gly Ala 

740 745 750 

He Val Gly Ala Met Thr Ser Xle Asn Ser Glu Met Leu Gly Leu Thr 
755 760 765 



His Trp Thr Thr Thr Pro Asn Phe 
770 775 

Thr Asn Glu Arg Thr Arg Gly Thr 
785 790 

Cys Glu Pro He He Thr Tyr Ser 

805 

Ala Leu Val Phe He Asn Val Thr Hiff Ser Asp Gly Asp Val Gin Pro 

820 825 " 830 



Tyr Tyr Ser He Tyr Asn Tyr 
780 

lie Asp Ser Asn Asp Val Asp 
795 800 

Xle Gly Val Cys Lys Asn Gly 
810 



He Ser Thr Gly Asn Val Thr He Pro Thr Asn Phe Thr lie Ser Val 
835 840 



Gin Val Glu Tyr He Gin Val Tyr Thr Thr Pro Val Ser He Asp Cys 
850 855 860 . 



Ser Arg Tyr Val Cys Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu Thr 

870 r 875 880 



Gin Tyr Val Ser Ala Cys Gin Thr He Glu Gin Ala Leu Ala Met Gly 

890 895 



Ala Arg Leu Glu Asn Met Glu He Asp Ser Met Leu Phe Val Ser Glu 

900 905 910 

Asn Ala Leu Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Thr Glu Thr 
915 920 925 

Leu Asp Pro He Tyr Lys Glu Trp Pro Asn He Gly Gly Ser Trp Leu 
530 935 940 



Gly Gly Leu Lys Asp He Leu Pro Ser 
945 950 



Asn Ser Lya 



Lys Tyr 
960 



Arg Ser Ala He Glu Asp Leu Leu. Phe Asp Lys Val Val Thr Ser Gly 

965 970 975 



Leu Gly Thr Val Asp Glu 

980 



Irys Arg Cys Thr Gly Gly Tyr Asp 
985 990 
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lie Ala Asp Leu Val Cys Ala Gin Tyr Tyr Asn Gly lie Met Val Leu 
995 1000 1005 

Pro Gly Val Ala Asn Asp Asp Lyo Met Ala Met *Tyr Thr Ala Ser Leu 
1010 1015 1020 

Ala Gly Gly lie Thr Leu Gly Ala Leu Gly Gly Gly Ala Val Ser lie 
1025 1030 1035 1040 

Pro Phe Ala lie Ala Val Gin Ala Arg Leu Asn Tyr Val Ala Leu Gin 

1045 1050 1055 

Thr Asp Val Leu Ser Lys Asn Gin Gin lie Leu Ala Asn Ala Phe Asn 

1060 1065 1070 

Gin Ala lie Gly Asn lie Thr Gin Ala Phe Gly Lys Val Aan Asp Ala 
1075 1080. 108& 

lie His Gin Thr Ser Gin Gly Leu Ala Thr Val Ala Lys Ala Leu Ala 
1090 1095 1100 

Lys Val Gin Asp Val Val Aon Thr Gin Gly Gin Ala Leu Ser His Leu 
1105 1110 1115 1120 

Thr Val Gin Leu Gin Asn Asn Phe Gin Ala He Ser Ser Ser lie Ser 

1125 1130 1135 

Asp He Tyr Asn Arg Leu Asp Glu Leu Ser Ala Asp Ala Gin Val Asp 

1140 1145 1150 

Arg Leu He Thr Gly Arg Leu Thr Ala Leu Asn Ala Phe Val Ser Gin 
1155 1160 1165 

Thr Leu Thr Arg Gin Ala Glu Val Arg Ala Ser Arg Gin Leu Ala Lys 
1170 1175 1180 

Asp Lys Val Asn Glu Cys Val Arg Ser Gin Ser Gin Arg Phe Gly Phe 
1185 1190 1195 1200 

Cys Gly Asn Gly Thr His Leu Phe Ser Leu Ala Asn Ala Ala Pro Asn 

1205 1210 1215 

Gly Met He Phe Phe His Thr Val Leu Leu Pro Thr Ala Tyr Glu Thr 

1220 1225 1230 

Val Thr Ala Trp Ser Gly He Cys Ala Ser Asp Gly Asp Arg Thr Phe 
1235 1240 1245 

Gly Leu Val Val Lys Asp Val Gin Leu Thr Leu Phe Arg Asn Leu Asp 
1250 1255 1260 

Asp Lys Phe Tyr Leu Thr Pro Arg Thr Met Tyr Gin Pro Arg Val Ala 
1265 1270 1275 1280 

Thr Ser Ser Asp Phe Val Gin He Glu Gly Cys Asp Val Leu Phe Val 

1285 1290 1295 

Asn Ala Thr Val He Asp Leu Pro Ser He He Pro Asp Tyr lie Asp 

1300 1305 1310 

He Asn Gin Thr Val Gin Asp He Leu Glu Asn Phe Arg Pro Asn Trp 
1315 1320 1325 
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Tiir Val Pro Glu Leu Pro Leu Asp Xle Phe Asn Ala Thr Tyr Leu Asn 
1330 1335 1340 

Leu Thr Gly Glu lie Asn Asp Leu Glu Phe Arg Ser Glu Lys Leu His 
1345 1350 1355 1360 

Asn Thr Thr Val Glu Leu Ala lie Leu lie Aep Asn lie Asn Asn Thr 

1365 1370 1375 

Leu Val Asn Leu Glu Trp Leu Asn Arg lie Glu Thr Tyr Val Lys Trp 

1380 1385 1390 

Pro Trp Tyr Val Trp Leu Leu lie Gly Leu Val Val lie Phe Cys lie 
1395 1400 1405 

Pro lie Leu Leu Phe Cys Cys Cys Ser Thr Gly Cys Cys Gly Cys lie 
1410 1415 1420 

Gly Cys Leu Gly ser cys Cys His Ser lie Cys Ser Arg Arg Arg Phe 
1425 1430 1435 " 1440 

Glu Ser Tyr Glu Pro lie Glu Lys Val His Val His 

1445 1450 



(2) INFORMATION FOR SEQ ID NO :3s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Gly Ser Val Val Val Gly Gly 
1 5 



Tyr Pro Thr Glu Val Trp Tyr Asa 
10 15. 



Cys Ser Arg Ser Ala Thr Thr Thr Ala Tyr Ly© Asp Phe Ser Asn He 

20 25 30 



His Ala Phe Tyr Phe Asp Met Glu 
35 ' 40 



Met Glu Aan Ser Thr Gly Asn 

45 



Ala Arg; Gly Lys Pro Leu Leu Val 
50 



Val His Gly Asp Pro Val Ser 
60 



He lie lie Tyr He Ser Ala Tyr 
65 70 



Asp Asp Val Gin Gly Arg Pro 

80 



Leu Leu Lys His Gly Leu Leu Cys 

85 

Tyr Asn Thr Phe Thr Set Ala Gin 

100 



Thr Lys Asn Lys ila He Asp 
90 g$ 



He Cys Leu Gly Asn 
110 



Arg Lys He Pro Phe Ser Val 
115 120 



He Pro Thr Gly Asn Gly Thr Lys 



Phe Gly Leu Glu Trp Asn Asp 
130 



Tyr Val Thr Ala Tyr He Ser 
140 
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Asp Arg Ser His His Leu Asn lie Asn Asn Asn Trp Phe Asn Asn Val 
145 150 155 160 

Thr lie Leu Tyr Ser Arg Ser Ser Thr Ala Thr Trp Gin Lys Ser Ala 

165 . 170 175 

Ala Tyr Val Tyr Gin Gly Val Ser Asn Phe Thr Tyr Tyr Lys Leu Asn 

180 185 190 

Asn Thr Asn Gly Leu Lya Ser Tyr Glu 
195 200 

(2) INFORMATION FOR 5EQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ser Cys Tyr Asn Asp Thr Val Ser Glu Ser Ser Phe Tyr Ser Tyr Gly 
IS 10 15 

Glu He Ser Phe Gly Val Thr Asp Gly Pro Arg Tyr Cys Tyr Ala Leu 

20 25 30 

Tyr Asn Gly Thr Ala Leu Lya Tyr Leu Gly Thr Leu Pro Pro Ser Val 
35 40 45 

Lys Glu lie 
50 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp Thr He Ala 
15 10 15 

Tyr Thr Ser Tyr Thr 

20 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SI amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Pro lie Ala Ser Thr Leu Ser Asn lie Thr Leu Pro Met Gin Asp Asn 
I 5 10 15 

Asn Thr Asp Val Tyr Cys lie Arg Ser Aen Gin Phe Ser Val Tyr Val 

20 25 3D 



His Ser Thr Cys I*ys Ser Ser Leu Trp Aep Asp Val Phe Asn Ser Asp 
35 40 45 

Cys Thr Asp 
50 



(2) INFORMATION FOR SEQ ID N0i7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Thr Asn Glu Gin Val Val Arg Ser Leu Tyr Val He Tyr- Glu Glu Gly 
1 5 lO 15 

Asp Asn He Val Gly Val Pro Ser Asp Asn Ser Gly Leu His Asp Leu 

20 25 30 



Ser Val Leu His Leu Asp 
35 

Thr Gly Val 
50 



Cys Thr Asp Tyr Asn He Tyr Gly Arg 
40 45 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8s 

Trp Thr Thr Thr Pro Asn Phe Tyr Tyr Tyr 
1 5 IP 

Asn Glu Arg Thr Arg Gly Thr Ala lie Asp Ser Asn Asp Val Asp Cys 

20 25 30 

Glu Pro lie lie Thr Tyr Ser Asn lie Gly Val Cys Lys Asn Gly Ala 
35 40 



Ser lie Tyr Asn Tyr Thr 



Leu Val Phe He Asn Vai Thr Ris Ser Asp Gly Asp Val Gin Pro He 
50 55 60 
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Ser Thr Gly Asn Val Thr He Pro Thr Asn Phe Thr He Ser Val Gin 
65 70 75 80 

Val 



(2) INFORMATION FOR SEQ ID HO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown * 

(ii) molecule TYPE j protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Glu Asn Met Glu lie Asp Ser Met Leu Phe Val Ser Glu Asn Ala Leu 
1 5 10 15 

Lys Leu Ala Ser Val Glu Ala Phe Aan Ser Thr Glu Thr Leu Asp Pro 

20 25 30 

He Tyr Lys Glu Trp Pro Asn He Gly Gly Ser Trp Leu Gly Gly Leu 
35 40 ~ ~ 45 

Lys Asp He Leu Pro Ser His Asn Ser Lys Arg Lys Tyr Arg Ser Ala 
50 55 * 60 

He Glu Asp Leu Leu Phe Asp Lys Val Val Thr Ser Gly Leu Gly Thr 
65 70 75 80 

Val Asp Glu Asp Tyr Lys Arg Cys Thr Gly Gly Tyr Asp He Ala Aso 

85 90 95 

Leu Val Cys Ala Gin Tyr Tyr Asn Gly He Met Val Leu Pro Gly Val 

100 105 110 

Ala Asn Asp Asp Lys Met Ala Met Tyr Thr Ala Ser Leu Ala 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 

Gin Val Asp Arg Leu He Thr Gly Arg Leu Thr Ala Leu Asn Ala Phe 
15 10 15 

Val Ser Gin Thr Leu Thr Arg Gin Ala Glu Val Arg Ala Ser Arg Gin 

20 25 30 

Leu Ala Lys Asp Lys Val Asn Glu Cys Val Arg Ser Gin Ser Gin Arg 
35 40 45 
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Phe Gly Phe Cys Gly Asn Gly Thr His Leu Phe Ser Leu Ala Asn Ala 
50 55 60 

Ala Pro Asn Gly Met lie Phe Phe His Thr Val Leu 

70 75 



(2) INFORMATION FOR SEQ ID NOsllr 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown * 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION? SEQ ID NO: 11: 

Leu Val Val Lys Asp Val Gin Leu Thr Leu Phe Arg Asn Leu Asp Asp 
1 S 10 15 

Lys Phe Tyr Leu Thr Pro Arg Thr Met .Tyr Gin Pro Arg Val Ala Thr 

20 25 30 

Ser ser- Asp Phe Val Gin lie Glu Gly CyB Asp Val Leu Phe Val Asn 

40 45 



Ala Thr Val lie Asp Leu Pro Ser lie lie Pro Asp Tyr lie Asp lie 
50 55 go 

Asn Gin Thr Val Gin Asp lie Leu Glu Asn Phe Arg Pro Asn Trp Thr 
65 70 75 SO 

Val Pro Glu Leu Pro Leu Asp lie Phe Asn Ala Thr Tyr Leu Asn Leu 

85 90 95 

Thr Gly Glu lie Asn Asp. Leu Glu Phe Arg Ser Glu Lys Leu His Asn 

100 105 no 

Thr Thr Val Glu Leu Ala lie Leu lie Asp Asn lie Asn Asn Thr Leu 

120 125 



Val Asn Leu Glu Trp Leu Asn Arg lie Glu Thr Tyr Val Lys Tro Pro 
130 135 140 

Trp Tyr Val Trp Leu Leu Tie Gly Leu Val Val lie Phe Cys lie Pro 
"5 150 155 160 

Tie Leu Leu Phe Cys Cys Cys Ser Thr Gly Cys cys Gly Cys lie Gly 

165 170 i7 5 

Cys Leu Gly Ser Cys. Cys His Ser lie Cys Ser Arg Arg Arg Phe Glu 

180 185 190 

Ser Tyr Glu Pro He Glu Lys Val His Val His 
195 200 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) topology: unknown 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp Phe Leu Phe His Thr Phe Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Trp Tyr Aan Cys Ser Arg Ser Ala Thr Thr Thr Ala Tyr Lys Asp Phe 
1 5 10 15 

Ser Asn lie 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Tyr Val Thr Ala Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Asn Thr Asn Gly Leu Lys Ser Tyr Glu Leu CyB Glu Asp Tyr Glu 
1 5 10 15 
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Cys Cys Thr Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr VaX Gly Glv 

20 25 30 

Tyr lie 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Leu Asn Asn Thr Val Asp 
1 5 

(2) INFORMATION FOR SEQ ID NOrl7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE.; amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly val Thr Asp Gly Pro Arg Tyr Cys Tyr Ala Leu Tyr Asn Gly Thr 

5 -10 1S 

Ala Leu Lys Tyr Leu Gly Thr Leu Pro Pro Ser Val Lys Glu lie Ala 

20 25 30 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 

Ser Tyr Thr Asp Ala Leu VaX Gin Val Glu Asn Thr Ala lie Lys Lys 
1 5 10 15 

Val Thr Tyr Cys Asn Ser His lie Asn Asn He 

20 25 
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{2} INFORMATION FOR SEQ ID NO: 19: 

(i) sequence: CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

lie Ser Val Gin Val Glu Tyr lie Gin Val Tyr Thr Thr Pro Val 
1 5-io 15 

(2) INFORMATION FOR SEQ ID NO: 20: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Lys Leu Ala Ser Val Glu Ala Phe Aen Ser Thr Glu Thr Leu Asp Pro 
1 5 10 15 

He Tyr Lys Glu Trp Pro Aen He Gly Gly Ser Trp Leu Gly Gly Leu 

20 25 30 

Lys Asp He Leu Pro 
35 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPEi protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Leu Gly Thr Val Asp Glu Asp Tyr Lys Arg Cys Thr Gly Gly Tyr Asp 
1 5 10 * 15 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION'S SEQ ID NO: 22: 

Ala Asn Ala Phe Asn Gin Ala lie Gly Asn lie Thr Gin Ala Phe Glv 
1 5 ' 10 15 

Lys Val Aen Aep Ala He His Gin Thr Ser Gin Gly Leu Ala Thr Val 

20 25 30 

Ala Lys Ala Leu Ala Lys Val Gin Aep Val Val Asn Thr Gin Gly Gin 
35 40 45 

Ala Leu Ser His Leu Thr Val Gin Leu Gin Asn Asn Phe Gin Ala He 
50 55 - 60 

Ser Ser Ser He Ser Asp He Tyr Asn Arg Leu Asp Glu Leu 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(U) TOPOLOGY r unknown 



(ii) MOLECULE TYPEs protein 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 s 



Leu Ala Xle Leu He Asp Asn He Asn Asn Thr Leu Val Asn Leu Glu 
1 5 10 



15 

Trp Leu Asn Arg He Glu Thr Tyr Val Lys 

20 25 y 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: DNA (genomic) 

* * * * 

(ix) FEATURE: 

(A) NAME/KEY? CDS 

(B) LOCATION: 1..372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CAA GGG CAA GCT TTA AGC CAC CTA ACA GTA CAA TTG CAA AAT AAT TTC 
Girt Gly Gin Ala Leu Ser His Leu Thr Val Gin Leu Gin Asn Asn Phe 
1 5 10 * 15 



48 



CAA GCC ATT AGT AGT TCC ATT AGT GAC ATT TAT AAC AGG CTT GAT GAA 96 

Gin Ala He Ser Ser Ser lie Ser Asp He Tyr Asn Arg Leu Aep Glu 

20 '25 30 

TTG AGT GCT GAT GCA CAA GK GAC AGG CTG ATT ACA GGA AGA CTT ACA 144 
Leu Ser Ala Asp Ala Gin Val Aep Arg Leu He Thr Gly Arg Leu Thr 

c 40 45 
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GCA CTT AAT GCA TTT GTG TCT CAG ACT TTA ACC AGA CAA GCA GAG GTT 192 
Ala Leu Asn Ala Phe Val Ser Gin Thr Leu Thr Arg Gin Ala Glu Val 
50 55 60 

AGG GCT AGC AGA CAG CTT GCT AAA GAC AAG GTA AAT GAA TGC GTT AGG 240 
Arg Ala Ser Arg Gin Leu Ala Lya Asp Lys Val Asn Glu Cys Val Arg 
65 70 75 80 

TCT CAA TCT CAG AGA TTT GGA TTC TGT GGT AAT GGT ACA CAT TTA TTT 288 
Ser Gin Ser Gin Arg Phe Gly Phe Cys Gly Aon Gly Thr His Lau Phe 

85 90 95 

TCA CTT GCA AAT GCA GCA CCA AAT GGC ATG ATC TTC TTT CAC ACA GTG 336 
Ser Leu Ala Asn Ala Ala Pro Asn Gly Met He Phe Phe His Thr Val 

100 105 110 

CTA TTA CCA ACA GCT TAT GAA ACC GTG ACG GCC TGG 372 
Leu Leu Pro Thr Ala Tyr Glu Thr Val Thr Ala Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE T¥PE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gin Gly Gin Ala Leu Ser His Leu Thr Val Gin Leu Gin Aen Asn Phe 
IS 10 15 

Gin Ala lie Ser Ser Ser lie Ser Asp He Tyr Asn Arg Leu Asp Glu 

20 25 30 

Leu Ser Ala Asp Ala Gin Val Asp Arg Leu He Thr Gly Arg Leu Thr 
35 40 45 

Ala Leu Asn Ala Phe Val Ser Gin Thr Leu Thr Arg Gin Ala Glu Val 
50 55 60 

Arg Ala Ser Arg Gin Leu Ala Lys Asp Lys Val Asn Glu Cys Val Arg 
65 70 75 80 

Ser Gin Ser Gin Arg Phe Gly Phe Cys Gly Asn Gly Thr His Leu Phe 

85 90 95 

Ser Leu Ala Asn Ala Ala Pro Asn Gly Met He Phe Phe His Thr Val 

100 105 110 

Leu Leu Pro Thr Ala Tyr Glu Thr Val Thr Ala Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 26: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double" 

(D) TOPOLOGY: unknown 



50OCID: <WO_0323423A1J_> 



WO 93/23423 



PCT/US93/04692 



54 

(ii) MOLECOT.E TYPE j DNA (genomic) 

(ix) FEATURE : 

(A) NAME/KEY" r CDS 

(B) LOCATION; 1..180 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CTT GGT ATG AAG CGT AGT GGT TAT GGT CAA CCC ATA GCC TCA ACA TTA 48 
Leu Gly Met Lys Arg Ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu 
15 10 15 

AGT AAC ATC ACA CTA CCA ATG C AG -GAT AAT AAC ACC GAT GTG TAC TGC 96 
Ser Asn lie Thr Leu Pro Met Gin Asp Asn Asn Thr Asp val Tvr Cvs 

20 25 30 

ATT CGT TCT AAC CAA TTT TCA GTT TAC GTT CAT TCC ACT TGT AAA AGT 144 
lie Arg Ser Asn Gin Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser 
35 40 45 

TCT TTA TGG GAC GAT GTG TTT AAT TCC GAC TGC ACA 180 
Ser Leu Trp Asp Asp Val Phe Asn Ser Asp Cys Thr 
50 55 60 

(2) INFORMATION FOR SEQ ID NO:27: 

(1) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 60 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Leu Gly Met Lys Arg Ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu 
1 5 lO 



Ser Asn He Thr Leu Pro Met Gin Aap Asn Aen Thr Asp Val Tyr Cys 

20 25 30 

lie Arg Ser Asn Gin Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser 

40 45 



Ser Leu Trp Asp Asp Val Phe Asn Ser Asp Cys Thr 
50 55 * 60 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: doufcle 

(D) TOPOLOGY i unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEYS CDS 
<B) LOCATIONS X.,141 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GTC ATT AGA TTC AAC CTT AAT TTT ACC ACA GAT GTA CAA TCT GGT ATG 48 
Val lie Arg Phe Asn Leu Agn Phe Thr Thr Asp Val Gin Ser Gly Met 
1 5 10 15 

GGT GCT ACA GTA TTT TCA CTG AAT ACA ACA GGT GGT GTC ATT CTT GAG 96 
Gly Ala Thr Val Phe Ser Leu Aon Thr Thr Gly Gly Val He Leu Glu 

20 25 30 

ATT TCT TGT TAT AAT GAT ACA GTG AGT GAG TCA AGT TTC TAC AGT 141 
He Ser Cys Tyr Asn Asp Thr Val Ser Glu Ser Ser Phe Tyr Ser 
35 40 ' 45 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 47 amino acids 
{ B ) TYPE: amino acid 
(D) TOPOLOGY x linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Val lie Arg Phe Asn Leu Asn Phe Thr Thr Asp Val Gin Ser Gly Met 
15 10 15 

Gly Ala Thr Val Phe Ser Leu Asn Thr Thr Gly Gly Val lie Leu Glu 

20 25 30 

lie Ser Cys Tyr Asn Asp Thr Val Ser Glu Ser Ser Phe Tyr Ser 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..51 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TGT ATA ACT AAA AAT AAA ATC ATT GAC TAT AAC ACG TTT ACC AGC GCA 48 

Cys He Thr Lys Asn Lys He He Asp Tyr Asn Thr Phe Thr Ser Ala 
15 10 15 



CAG 
Gin 



51 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 17 amino acids 
(B) TYPE: amino acid 
(D> TOPOLOGY: Linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Cys He Thr Lya Aan Lya He He Asp Tyr Aan Thr Phe Thr Ser Ala 
i 5 -10 15 

Gin 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) topology : unknown 

(ii) MOLECULE TYPE? DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION t 1..42 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TCT TOT TAT AAT GAT ACA GTG AGT GAG TCA ACT TTC TAC AGT 42 
Ser Cys Tyr Asn Asp Thr Val Ser Glu Ser Ser Phe Tvr Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ser Cys Tyr Asn Asp Thr Val Ser Glu Ser Ser Phe Tyr Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic} 
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(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .51 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

ATT GGG TGT TTA GGA AGC TGT TGT CAT TCC ATA TGT AGT AGA AGG CGA 48 
lie Gly Cys Leu Gly Ser Cya Cya His Ser lie Cya Ser Arg Arg Arg 
15 10 15 

TTT 5 2 

Phe 

« 

{2} INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE? TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 : 

lie Gly Cya Leu Gly Ser Cys Cys His Ser lie Cys Ser Arg Arg Arg 
15 10 15 

* 

Phe 

(2) INFORMATION FOR SEQ ID NO:36t 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 baBe pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..42 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TGC ATA CCC ATA TTG CTA TTT TGT TGT TGT AGC ACT GGT TGT 42 
Cys lie Pro lie Leu Leu Phe Cys Cys Cys Ser Thr Gly Cya 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 37 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Cys lie Pro lie Leu Leu Phe Cys 

1 5 



Cys ser Thr Gly Cys 
10 



(2) INFORMATION FOR SEQ ID NO = 38: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: unknown • 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1^195 

(xi) SEQUENCE DESCRIPTION s SEQ ID NO: 38s 
TAC TTA AAC CTG ACT GOT GAA ATT AAT GAC TTA GAA TTT AGG TCA GAA 

Tyr Leu Asn Leu Thr Gly Glu lie Asn Asp Leu Glu Phe Arg Ser Glu 
1 5 10 15 

AAG TTA CAT AAC ACC ACA GTA GAA CTT GCT ATT CTC ATT GAT AAT ATT 
Lys Leu His Asn Thr Thr Val Glu Leu Ala lie Leu He Asp Asn He 

20 25 30 

AAT AGA ATT GAA ACT TAT 
Asn Arg lie Glu Thr Tyr 
45 

ATT GGA TTA GTA GTA ATA 
He Gly Lea Val Val lie 
60 



AAT AAC ACA TTA GTC AAT CTT GAA TGG 
Asn Asn Thr Leu Val Asn Leu Glu Trp 

40 



GTA AAA TGG CCT TGG TAT GTG TGG CTA 
Val Lys Trp Pro Trp Tyr Val Trp Leu 
50 ^ 55 



48 



96 



144 



192 



TTC 
Phe 
65 



195 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS 
{A} LENGTH: 65 
(B) TYPE: amino 
(D) TOPOLOGY : 



(ii) MOLECULE TYPE: protein. 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:39 



Tyr Leu Asm Leu Thr Gly Glu lie 
l 5 



Asp Leu Glu Phe Arg Ser Glu 
10 



Lys Leu His Asn Thr Thr Val Glu Leu Ala He 

20 25 



Leu He Asp Asn lie 
30 



Asn Asn Thr Leu Val Asn Leu Glu Trp Leu Aen Arg He Glu Thr Tyj 
35 40 45 
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Val LyB Trp Pro Trp Tyr Val Trp Leu Leu lie Gly Leu Val Val lie 
50 55 60 

Phe 
65 

(2) INFORMATION FOR SEQ ID NO: 40: 
<i) SEQUENCE CHARACTERISTICS z 

(A) LENGTH: 765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix ) FEATURE : 

<A) NAME /KEY: CDS 
(B) LOCATION: 1. ,765 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

GAT GGA CCG CGT TAG TGT TAC GCA CTC TAT AAT GGC ACG GCT CTT AAG 48- 
Asp Gly Pro Arg Tyr Cye Tyr Ala Leu Tyr Asn Gly Thr Ala Leu Lys 
1 5 10 15 

TAT TTA GGA ACA TTA CCA CCT AGT GTC AAG GAA ATT GCT ATT AGT AAG 96 

Tyr Leu Gly Thr Leu Pro Pro Ser Val LyB Glu lie Ala lie Ser Lys 

20 25 30 

TGG GGC CAT TTT TAT ATT AAT GGT TAC AAT TTC TTT AGC ACT TTT CCT 144 
Trp Gly His Phe Tyr lie Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro 
35 40 45 

ATT GAT TGT ATA TCT TTT AAT TTA ACC ACT GGT GAT AGT GGA GCA TTT 192 

lie Asp Cys He Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe 
50 55 60 

TGG ACA ATT GCT TAC ACA TCG TAC ACT GAC GCA TTA GTA CAA GTT GAA 240 
Trp Thr He Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Glu 
65 70 75 80 

AAC ACA GCT ATT AAA AAG GTG ACG TAT TGT AAC AGT CAC ATT AAT AAC 288 
Asn Thr Ala lie Lys Lys Val Thr Tyr Cys Asn Ser His He Asn Asn 

85 90 95 

ATT AAA TGT TCT CAA CTT ACT GCT AAT TTG CAA AAT GGA TTT TAT CCT 336 
He Lys Cys Ser Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe Tyr Pro 

100 105 110 

GTT GCT TCA AGT GAA GTT GGT CTT GTC AAT AAG AGT GTT GTG TTA CTA 384 

Val Ala ser ser Glu Val Gly Leu Val Asn Lys ser Val val Leu Leu 
115 120 125 

CCT AGT TTC TAT TCA CAT ACC AGT GTT AAT ATA ACT ATT GAT CTT GGT 432 
Pro Ser Phe Tyr Ser His Thr Ser Val Asn He Thr He Asp Leu Gly 
130 135 140 

ATG AAG CGT AGT GGT TAT GGT CAA CCC ATA GCC TCA ACA TTA AGT AAC 480 
Met Lys Arg Ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu Ser Asn 
145 150 * 155 160 
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ATC ACA CTA CCA ATG CAG GAT AAT AAC ACC GAT GTG TAC TGC ATT CGT 528 
lie Thr Leu Pro Met Gin Asp Asn Aan Thr Asp Val Tyr CyB lie Arg 

165 170 175 

TCT AAC CAA TTT TCA GTT TAC GTT CAT TCC ACT TGT AAA AGT TCT TTA 576 
Ser Asn Gin Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu 

180 185 190 

TGG GAC GAT GTG TTT AAT TCC GAC TGC ACA GAT GTT TTA TAT GCT ACA 624 
Trp Asp Asp Val Phe Asn Ser Asp Cys Thr Aso Val Leu Tyr Ala Thr 
195 200 ~ 205 

GCT GTT ATA AAA ACT GGT ACT TGT CCT TTC TCG TTT GAT AAA TTG AAC 672 
Ala Val lie Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn 
210 215 220 

AAT TAC TTA ACT TTT AAC AAG TTC TGT TTG TCA TTG AAT CCT GTT GGT 720 
Asn Tyr Leu Thr Phe Asn Lys Phe Cys Leu Ser Leu Asn Pro Val Gly 
225 230 235 240 

GCC AAC TGC AAG TTT GAT GTT GCC GCT CGT ACA AGA ACC AAT GAG 765 
Ala Asn Cys Lys Phe Asp Val Ala Ala Arg Thr Arg Thr Asn Glu 

245 250 

(2) INFORMATION FOR SEQ ID NO 5 41 2 

(i) SEQUENCE CHARACTERISTICS z 

(A) LENGTH? 255 amino acids 

(B) TYPEs amino acid: 
(D> topologts linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION r SEQ ID NO: 41s 

Asp Gly Pro Arg Tyr Cys Tyr Ala Leu Tyr Asn Gly Thr Ala Leu Lys 
1 5 10 15 



Tyr Leu Gly Thr Leu Pro Pro Ser Val Lys Glu lie Ala lie Ser Lvs 

20 25 30 

Trp Gly His Phe Tyr lie Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro 
35 40 45 

He Asp Cys lie Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe 
50 55 60 

Trp Thr He Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Glu 
65 70 75 80 

Asn Thr Ala lie Lys Lys Val Thr Tyr Cys Asn Ser His lie Asn Asn 

90 95 



He Lys Cys ser Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe T^r Fro 

100 105 no 

Val Ala Ser Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu 

120 125 



Pro Ser Phe Tyr Ser His Thr Ser Val: Asn He Thr He Asp Leu Gly 
130 135 140 
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Met Lys Arg Ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu Ser Asn 
145 150 155 160 

He Thr Leu Pro Wet Gin Asp Asn Asn Thr Asp Val Tyr Cys He Arg 

155 170 175 

Ser Asn Gin Phe Ser Val Tyr Val His Ser Thr CyB Lye Ser Ser Leu 

180 18S 190 

Trp Asp Asp Val Phe Asn Ser Asp Cys Thr Asp Val Leu Tyr Ala Thr 
195 200 205 

Ala Val lie Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn 
210 215 220 

Asn Tyr Leu Thr Phe Asn Lys Phe Cys Leu Ser Leu Asn Pro Val Gly 
225 230 235 24D 

Ala Asn cys Lys Phe Asp Val Ala Ala Arg Thr Arg Thr Asn Glu 

245 250 255 

(2) INFORMATION FOR SEQ ID NO: 42 s 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH s 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs double 
<D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE; DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

( B ) LOCATION: 1..1284 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

AGG CCT CTT TTA AAA CAT GGT TTG TTG TGT ATA ACT AAA AAT AAA ATC 48 
Arg Pro Leu Leu Lys His Gly Leu Leu Cys lie Thr Lys Asn Lys lie 
15 10 15 

ATT GAC TAT AAC ACG TTT ACC AGC GCA CAG TGG AGT GCC ATA TGT TTG 96 
lie Asp Tyr Asn Thr Phe Thr Ser Ala Gin Trp Ser Ala lie Cys Leu 

20 25 30 

GGT GAT GAC AGA AAA ATA CCA TTC TCT GTC ATA CCC ACA GGT AAT GGT 144 
Gly Asp Asp Arg Lys lie Pro Phe Ser Val lie Pro Thr Gly Asn Gly 
35 40 45 

ACA AAA ATA TTT GGT CTT GAG TGG AAT GAT GAC TAT GTT ACA GCC TAT 192 
Thr Lys He Phe Gly Leu Glu Trp Asn Asp Asp Tyr Val Thr Ala Tvr 
50 55 60 

ATT AGT GAT CGT TCT CAC CAT TTG AAC ATC AAT AAT AAT TGG TTT AAC 240 
He Ser Asp Arg Ser His His Leu Asn lie Asn Asn Asn Trp Phe Asn 
65 70 75 80 

AAT GTG ACA ATC CTA TAC TCT CGA TCA AGC ACT GCT ACG TGG CAG AAG 288 
Asn Val Thr lie Leu Tyr Ser Arg Ser Ser Thr Ala Thr Trp Gin Lys 

30 95 
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AGT GCT GCA TAT GTT TAT CAA GOT GTT TCA AAT TTT ACT TAT TAC AAG 336 
Ser Ala Ala Tyr Val Tyr Gin Gly Val Ser Asn Phe Thr Tyr Tyr Lys 

100 105 110 

TTA AAT AAC ACC AAT GGC TTG AAA AGC TAT GAA TTG TGT GAA GAT TAT 384 
Leu Asn Asn Thr Asn Gly Leu Lys Ser Tyr Glu Leu Cys Glu Asp Tyr 
115 120 125 

GAA TGC TGC ACT GGC TAT GCT ACC AAC GTA TTT GCC CCG ACA GTG GGC 432 
Glu Cys Cys Tar Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr Val Gly 
130 135 140 

GGT TAT ATA CCT GAT GGC TTC AGT TTT AAC AAT TGG TTT ATG CTT ACA 480 
Gly Tyr He Pro Asp Gly Phe Ser Phe Asn Asn Trp Phe Met Leu Thr 
145 150 155 * 160 

AAC AGT TCC ACG TTT GTT AGT GGC AGA TTT GTA ACA AAT CAA CCA TTA 528 
Asn ser Ser Thr Phe Val Ser Gly Arg Phe Val Thr Asn Gin Pro Leu 

165 170 175 

TTG GTT AAT TGT TTG TGG CCA GTG CCC AGT CTT GGT GTC GCA GCA CAA 576 
Leu Val Asn Cys Leu Trp Pro Val Pro Ser Leu Gly Val Ala Ala Gin 

180 185 190 

GAA TTT TGT TTT GAA GGT GCG CAG TTT AGC CAA TGT AAT GGT GTG TCT 624 
Glu Phe Cys Phe Glu Gly Ala Gin Phe Ser Gin Cys Asn Gly Val Ser 
195 200 205 

TTA AAC AAT ACA GTG GAT GTC ATT AGA TTC AAC CTT AAT TTT ACC ACA 672 
Leu Asn Asn Thr Val Asp Val He Arg Phe Asn Leu Asn Phe Thr Thr 
210 215 220 

GAT GTA CAA TCT GGT ATG GGT GCT ACA GTA TTT TCA CTG AAT ACA ACA 720 
Asp Val Gin Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr 
225 230 235 240 

GGT GGT GTC ATT CTT GAG ATT TCT TGT TAT AAT GAT ACA GTG AGT GAG 768 
Gly Gly Val lie Leu Glu lie Ser Cys Tyr Asn Asp Thr Val Ser Glu 

245 250 



TCA AGT TTC TAC AGT TAT GGT GAA ATT TCA TTC GGC GTA ACT GAT GGA 816 
Ser ser Phe Tyr Ser Tyr Gly Glu He Ser Phe Gly Val Thr Asp Gly 

260 265 270 

CCG CGT TAC TGT TAC GCA CTC TAT AAT GGC ACG GCT CTT AAG TAT TTA 864 
Pro Arg Tyr Cys Tyr Ala Leu Tyr Asn Gly Thr Ala Leu Lys Tyr Leu 
275 28D 285 

GGA ACA TTA CCA CCT AGT GTC AAG GAA ATT GCT ATT AGT AAG TGG GGC 912 
Gly Thr Leu Pro Pro Ser Val Lye Glu He Ala lie Ser Lys Trp Gly 
290 295 300 

CAT TTT TAT ATT AAT GGT TAC AAT TTC TTT -AGC ACT TTT CGT ATT GAT 960 
His Phe Tyr Xle Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro lie Asp 
305 310 315 320 

TGT ATA TCT TTT AAT TT$l ACC ACT GGT GAT AGT GGA GCA TTT TGG ACA 1008 

Cys Xle Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp Thr 

330 335 



ATT GCT TAC ACA TC<5 TAC ACT GAC GCA TTA GTA CAA GTT GAA AAC ACA 1056 
Zle Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Qlu Asn Thr 

340 S45 350 
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GCT ATT AAA AAG GTG ACG TAT TGT AAC AGT CAC ATT AAT AAC ATT AAA 1104 

Ala lie Lys Lya Val Thr Tyr Cys Aan Ser His lie Asn Asn lie Lys 
355 360 365 

TGT TCT CAA CTT ACT GCT AAT TTG CAA AAT GGA TTT TAT CCT GTT GCT 1152 
Cys Ser Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe Tyr Pro Val Ala 
370 375 380 

TCA AGT GAA GTT GGT CTT GTC AAT AAG AGT GTT GTG TTA CTA CCT AGT 1200 
Ser Ser Glu Val Gly Leu Val Aen Lye Ser Val Val Leu Leu Pro Ser 
385 390 395 400 

TTC TAT TCA CAT ACC AGT GTT AAT ATA ACT ATT GAT CTT GGT ATG AAG 1248 
Phe Tyr Ser His Thr Ser Val Asn lie Thr lie Asp Leu Gly Met Lys 

405 410 415 

CGT AGT GGT TAT GGT CAA CCC ATA GCC TCA ACA TTA 1284 
Arg Ser Gly Tyr Gly Gin Pro lie Ala Ser Thr Leu 

420 425 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 : 

Arg Pro Leu Leu Lys His Gly Leu Leu Cys lie Thr Lys Asn Lys lie 
IS 10 15 

lie Asp Tyr Asn Thr Phe Thr Ser Ala Gin Trp Ser Ala lie Cys Leu 

20 25 30 

Gly Asp Asp Arg Lys lie Pro Phe Ser Val lie Pro Thr Gly Asn Gly 
35 40 45 

Thr Lys lie Phe Gly Leu Glu Trp Asn Asp Asp Tyr Val Thr Ala Tyr 
50 55 ** 60 

He Ser Asp Arg Ser His His Leu Asn He Asn Asn Asn Trp Phe Asn 
65 70 75 80 

Asn Val Thr He Leu Tyr Ser Arg Ser Ser Thr Ala Thr Trp Gin Lys 

85 90 ~ 95 

Ser Ala Ala Tyr Val Tyr Gin Gly Val Ser Asn Phe Thr Tyr Tyr Lys 

100 105 110 

Leu Asn Asn Thr Asn Gly Leu Lys Ser Tyr Glu Leu Cys Glu Asp Tyr 
115 120 125 

Glu Cys Cys Thr Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr Val Gly 
130 135 140 

Gly Tyr He Pro Asp Gly Phe Ser Phe Asn Asn Trp Phe Met Leu Thr 
145 150 155 " 160 
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Asn Ser Ser Thr Phe val Ser Gly Arg Phe Val Thr Asn Gin Pro Leu 

165 170 175 

Leu Val Asn Cya Leu Trp Pro Val Pro Ser Leu Gly Val Ala Ala Gin 

180 185 190 

Glu Phe Cys Phe Glu Gly Ala Gin Phe Ser Gin cys Asn Gly Val Ser 
195 200 205 

Leu Asn Asn Thr val Asp Val Xle Arg Phe Asn Leu Asn Phe Thr Thr 
210 215 220 

Asp. Val Gin Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr 
225 230 235 240 

Gly Gly Val He Leu Glu He Ser Cys Tyr Asn Asp Thr Val Ser Glu 

245 250 255 

Ser Ser Phe Tyr Ser Tyr Gly Glu lie Ser Phe Gly Val Thr Asp Gly 

260 265 270 

Pro Arg Tyr Cys Tyr Ala Leu Tyr Asn Gly Thr Ala Leu Lys Tyr Leu 
275 280 285 

Gly Thr Leu Pro Pro Ser Val Lys Glu He Ala He Ser Lys Trp Gly 
290 295 300 

His Phe Tyr He Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro He Asp 
305 310 315 320 

Cys lie Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp Thr 

325 330 



He Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Glu Asn Thr 

340 345 350 

Ala He Lys Lys Val Thr Tyr Cys Asn Ser His lie Asn Asn lie Lys 
355 360 365 

Cys Ser Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe Tyr Pro Val Ala 
370 375 380 

Ser Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro Ser 
385 390 395 400 

Phe Tyr Ser His Thr Ser Val Asn lie Thr He Asp Leu Gly Met Lys 

405 410 415 

Arg Ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu 

420 425 

(2) INFORMATION FOR SEQ ID NOt44; 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS • double 

(D) TOPOLOGY: unknown 



(JJ.) MOLECULE TYPES DNA (genomic) 
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( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..546 

(3d.) SEQUENCE DESCRIPTION: SEQ ID NO: 44 : 

GAT TGT ATA TCT TTT AAT TTA ACC ACT GGT GAT AGT GGA GCA TTT TGG 48 
Asp Cys II© Ser Phe Aon Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp 
15 10 15 

ACA ATT GCT TAC ACA TCG TAC ACT GAC GCA TTA GTA CAA GTT GAA AAC 96 
Thr He Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Glu Asn 

20 . 25 30 

ACA GCT ATT AAA AAG GTG ACG TAT TGT AAC AGT CAC ATT AAT AAC ATT 144 
Thr Ala He Lys Lye Val Thr Tyr Cys Asn Ser His He Asn Asn He 
35 40 45 

AAA TGT TCT CAA CTT ACT GCT AAT TTG CAA AAT GGA TTT TAT CCT GTT 192 
Lys Cys Ser Gin Leu Thr Ala Aen Leu Gin Asn Gly Phe Tyr Pro Val 
50 55 60 

GCT TCA AGT GAA GTT GGT CTT GTC AAT AAG AGT GTT GTG TTA CTA CCT 240 
Ala Ser Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro 
65 70 75 80 

AGT TTC TAT TCA CAT ACC AGT GTT AAT ATA ACT ATT GAT CTT GGT ATG 2 88 

Ser Phe Tyr Ser His Thr Ser Val Asn He Thr He Asp Leu Gly Met 

85 90 95 

AAG CGT AGT GGT TAT GGT CAA CCC ATA GCC TCA ACA TTA AGT AAC ATC 336 
Lys Arg Ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu Ser Asn He 

100 105 HO 

ACA CTA CCA ATG CAG GAT AAT AAC ACC GAT GTG TAC TGC ATT CGT TCT 384 

Thr Leu Pro Met Gin Asp Asn Aan Thr Asp Val Tyr Cys He Arg Ser 
115 120 125 

AAC CAA TTT TCA GTT TAC GTT CAT TCC ACT TGT AAA AGT TCT TTA TGG 432 
Asn Gin Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu Trp 
130 135 ~ 140 

GAC GAT GTG TTT AAT TCC GAC TGC ACA GAT GTT TTA TAT GCT ACA GCT 480 
Asp Asp Val Phe Asn Ser Asp Cys Thr Asp Val Leu Tyr Ala Thr Ala 
145 150 155 160 

GTT ATA AAA ACT GGT ACT TGT CCT TTC TCG TTT GAT AAA TTG AAC AAT 528 
Val He Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn 

165 170 175 

TAC TTA ACT TTT AAC AAG 545 

Tyr Leu Thr Phe Asn Lys 

180 



(2) INFORMATION FOR SEQ ID NO: 45s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Asp Cys He Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp 
1 5 10 X5 

Thr lie Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Glu Asn 

20 25 30 

Thr Ala He Lys Lys Val Thr Tyr Cys Asn ser His lie Asn Asn He 
35 40 45 

Lys Cys Ser Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe Tyr Pro Val 
50 55 60 

Ala ser ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro 
65 70 75 80 

Ser Phe Tyr Ser His Thr Ser Val Asn He Thr He Asp Leu Gly Met 

85 90 95 

Lys Arg ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu Ser Asn He 

100 105 110 

Thr Leu Pro Met Gin Asp Asn Asn Thr Asp Val Tyr Cys He Arg Ser 
115 120 125 

Asn Gin Phe Ser Val Tyr Val His Ser Thr Cys Lye Ser Ser Leu Trp~ 
130 135 140 

Asp Asp Val Phe Asn Ser Asp Cys Thr Asp Val Leu Tyr Ala Thr Ala 
145 150 155 160 

Val He Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn 

165 170 175 

Tyr Leu Thr Phe Asn IryB 

180 

(2) INFORMATION FOR SEQ ZD NCH46: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH* 38 base pairs 

(B) TYPE: nucleic acid 

(C) strandeone SS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION! SEQ ID NO: 46: 
TAAATAGGCC TTTAGTGGAC ATGCACTTTT TCAATTGG 38 

(2) INFORMATION FOR SEQ ID Nfr:47: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
TTAGTAGGCC TGTCGAGGCT ATGGGTTGAC CATAACCAC 39 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown • 

(ii) MOLECULE TYPE: DNA (genomic) 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CAGATCCCGG GTGTACAATC TGGTATGGGT GCTACAG 37 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTGCCCCCGG GTATGATTGT GCTCGTAACT TGCCTCTTG 39 



(2) INFORMATION FOR SEQ ID NO: 50; 

{ i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AGCACCCATA CCAGATTGTA CATCTGCAGT GAAATTAAGA TTG 43 



(2) INFORMATION FOR SEQ ID NO: 51: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51s 

Met lie Val Leu Val Thr Cys Leu Leu Phe Ser Tyr Asn Ser Val lie 
1 5 10 15 

Cys Thr Ser Asn Asn Asp Cys Val Gin Val Asn Val Thr Gin Leu Pro 

20 25 30 

Gly Asn Glu Asn Xle lie Lys Asp Phe Leu Phe His Thr Phe Lys Glu 
35 40 45 

Glu Gly ser Val Val Val Gly Gly Tyr Tyr Pro Thr Glu Val Trp Tyr 
50 55 - 60 

Asn Cys Ser Arg Ser Ala Thr Thr Thr Ala Tyr Lys Asp Phe Ser Asn 
65 70 75 80 

lie His Ala Phe Tyr Phe Asp Met Glu Ala Met Glu Asn Ser Thr Gly 

85 9Q 95 

Asn Ala Arg Gly Lys Pro Leu Leu Val His Val His Gly Asp Pro Val 

100 105 110 

ser lie He lie Tyr lie Ser Ala Tyr Arg Asp Asp Val Gin Gly Arg 
115 120 125 

(2) INFORMATION" FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 amino acids 

(B) TYPEr amino acid 
(D) TOPOLOGY r unknown 

(iiy MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Asp Val Gin Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr 
1 5 10 15 

Gly Gly Val lie Leu Glu lie Ser Cys Tyr Asn Asp Thr Val Ser Glu 

20 25 30 

Ser Ser Phe Tyr Ser Tyr Gly Glu He Ser Phe Gly Val Thr Asp Glv 
35 40 45 

Pro Arg Tyr Cys Tyr Ala Leu Tyr Asn Gly Thr Ala Leu Lys Tyr Leu 
50 55 60 

Gly Thr Leu Pro Pro Ser Val Lys Glu He Ala lie ser Lys Trp Gly 
65 70 75 so 



His Phe Tyr He Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro He Asp 

85 90 95 

Cys He Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp Thr 

100 105 HQ 

He Ala Tyr Thr Ser Tyr Thr Asp Ala Leu Val Gin Val Glu Asn Thr 

120 125 
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Ala lis Lys Lys Val Thr Tyr Cys Asn Ser His lie Asn Asn lie Lys 
130 135 140 

Cys Ser Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe Tyr Pro Val Ala 
145 150 155 160 

Ser Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro Ser 

165 170 175 

Phe Tyr Ser His Thr Ser Val Asn lie Thr lie Asp Leu Gly Met Lys 

180 185 190 

Arg Ser Gly Tyr Gly Gin Pro lie Ala Ser Thr Leu Ser Asn lie Thr 
195 200 205 

Leu Pro Met Gin Asp Asn Asn Thr Asp Val Tyr Cys lie Arg Ser Asn 
210 215 220 

Gin Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu Trp Asp 
225 230 235 240 

Asp Val Phe Asn Ser Asp CyB Thr Asp Val Leu Tyr Ala Thr Ala Val 

245 250 255 

lie Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr 

260 265 270 

Leu Thr Phe Asn Lys Phe Cys Leu Ser Leu Asn Pro Val Gly Ala Asn 
275 280 285 

Cys Lys Phe Asp Val Ala Ala Arg Thr Arg Thr Asn Glu Gin Val Val 
290 295 300 

Arg Ser Leu Tyr Val lie Tyr Glu Glu Gly Asp Asn He Val Gly Val 
305 310 315 320 

Pro Ser Asp Asn Ser Gly Leu His Asp Leu Ser Val Leu His Leu Asp 

325 330 335 

Ser Cys Thr Asp Tyr Asn lie Tyr Gly Arg Thr Gly Val Gly He He 

340 345 350 

Arg Gin Thr Asn Ser Thr Leu Leu Ser Gly Leu Tyr Tyr Thr Ser Leu 
355 360 365 

Ser Gly Asp Leu Leu Gly Phe Lys Asn Val Ser Asp Gly Val lie Tyr 
370 375 380 

Ser Val Thr Pro Cys Asp Val Ser Ala Gin Ala Ala Val lie Asp Gly 
385 390 395 400 

Ala He Val Gly Ala Met Thr Ser He Asn Ser Glu Met Leu Gly Leu 

405 410 415 

Thr His Trp Thr Thr Thr Pro Asn Phe Tyr Tyr Tyr Ser He Tyr Asn 

420 425 430 

Tyr Thr Asn Glu Arg Thr Arg Gly Thr Ala He Asp ser Asn Asp Val 
435 440 445 

Asp Cys Glu Pro He He Thr Tyr Ser Asn He Gly Val Cys Iats Asn 
450 455 460 
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Gly Ala Leu Val Phe lie Asn Val Thr His Ser Asp Gly Asp Val Gin 
46S 470 475 ~ ~ ~ 480 

Pro He Ser Thr Gly Asn Val Thr He Pro Thr Asn Phe Thr He Ser 

485 490 495 

Val Gin Val Glu Tyr lie Gin Val Tyr Thr Thr Pro Val Ser He Asp 

S00 505 510 • 

Cys Ser Arg Tyr Val Cys Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu 

520 525 



Thr Gin Tyr Val Ser Ala Cys Gin Thr He Glu Gin Ala Leu Ala Met 
530 535 540 

Gly Ala Arg Leu Glu Asn Met Glu He Asp Ser Met Leu Phe Val Ser 
545 550 555 560 

Glu Asn Ala Leu Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Thr Glu 

565 570 575 



Thr Leu Asp Pro He Tyr Lys Glu Trp Pro Asn He Gly Gly Ser Trp 

580 585 590 



Leu Gly Gly Leu Lys Asp lie Leu Pro Ser His Asn Ser Lys Arg Lys 
S95 600 605 

Tyr Arg Ser Ala He Glu Asp Leu Leu Phe Asp Lys Val Val Thr Ser 
610 615 620 

Gly Leu Gly Thr Val Asp Glu Asp Tyr Lys Arg Cys Thr Gly Gly Tyr 
625 "0 ~* 635 * * 640 



Asp He Ala Asp Leu Val Cys Ala Gin Tyr Tyr Asn Gly He Met Val 

545 650 * 655 

Leu Pro Gly Val Ala Asn Asp Asp Lys Met Ala Met Tyr Thr Ala Ser 

660 665 670 

Leu Ala Gly Gly lie Thr Leu GXy Ala Leu Gly Gly Gly Ala Val Ser 
675 68G 



He Pro Phe Ala He Ala Val Gin Ala Arg Leu Asn Tyr Val Ala Leu 
690 695 700 

Gin Thr Asp Val Leu Ser Lye Asn Gin Gin He Leu Ala Asn Ala Phe 
705 710 715 720 

Asn Gin Ala He Gly Asn He Thr Gin Ala Phe Gly Lys Val Asn Asp 

725 730 735 

Ala He His Gin Thr Ser Gin Gly Leu Ala Thr Val Ala Lys Ala Leu 

740 745 750 

Ala Lye Val Gin Asp Val Val Asn Thr Gin Gly Gin Ala Leu Ser His 
755 760 765 

Leu Thr Val Gin Leu Gin Asn Asn Phe Gin Ala He Ser Ser Ser lie 
770 775 780 

ser Asp He Tyr Asn Arg Leu Asp Glu Leu Ser Ala Asp Ala Gin Val 
785 790 795 8Q0 
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Asp Arg Leu He Thr Gly Arg Leu Thr Ala Leu Asn Ala Phe Val Ser 

80S 810 815 

Gin Thr Leu Thr Arg Gin Ala Glu Val Arg Ala Ser Arg Gin Leu Ala 

820 825 830 

Lys Asp Lys Val Aen Glu Cys Val Arg Ser Gin Ser Gin Arg Phe Gly 
835 840 845 

Phe Cys Gly Asn Gly Thr Hie Leu Phe Ser Leu Ala Asn Ala Ala Pro 
850 855 860 

Asn Gly Met He Phe Phe His Thr Val Leu Leu Pro Thr Ala Tyr Glu 
865 870 875 * 880 



Thr Val Thr Ala Trp Ser Gly He Cys Ala Ser Asp Gly Asp Arg Thr 

885 890 ~ 895 

Phe Gly Leu Val Val Lys Asp Val Gin Leu Thr Leu Phe Arg Asn Leu 

900 90S 910 

Asp Asp Lys Phe Tyr Leu Thr Pro Arg Thr Met Tyr Gin Pro Arg Val 
915 920 925 

Ala Thr Ser Ser Asp Phe Val Gin He Glu Gly Cys Asp Val Leu Phe 
930 935 940 

Val Asn Ala Thr Val He Asp Leu Pro Ser He He Pro Asp Tyr He 
945 950 955 960 

Asp He' Asn Gin Thr Val Gin Asp He Leu Glu Asn Phe Arg Pro Asn 

965 970 975 

Trp Thr Val Pro Glu Leu Pro Leu Asp He Phe Asn Ala Thr Tyr Leu 

980 985 990 

Asn Leu Thr Gly Glu He Asn Asp Leu Glu Phe Arg Ser Glu Lys Leu 
995 1000 1005 . 

His Asn Thr Thr Val Glu Leu Ala He Leu He Asp Asn He Asn Asn 
1010 1015 1020 

Thr Leu Val Asn Leu Glu Trp Leu Asn Arg He Glu Thr Tyr Val Lys 
1025 1030 1035 "* 1040 

Trp Pro Trp Tyr Val Trp Leu Leu He Gly Leu Val Val He Phe Cys 

1045 1050 1055 

He Pro He Leu Leu Phe Cys Cys Cys Ser Thr Gly Cys Cys Glv Cvs 

1060 1065 1070 

He Gly Cys Leu Gly Ser Cys Cys His Ser He Cys Ser Arg Aro Aro 
1075 1080 1085 

Phe Glu Ser Tyr Glu Pro He Glu Lys Val His Val His 
1090 1095 1100 

(2) INFORMATION FOR SEQ ID NO: 53: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TXPEs protein 

(xi) SEQDENCR DESCRIPTION": SKQ ID NO: 53: 

Met lie Val Leu Val Thr Cya Leu Leu Phe Ser Tyir Asn Ser Vai lie 
1 5 10 



15 



Cys Thr Ser Asn Asn Asp Cya Val Gin Val Asn Val Thr Gin Leu Pro 



20 



30 



Gly Asn Glu Asn lie He Lys Asp Phe Leu Phe His Thr Phe Lys Glu 
35 40 45 

0 

Glu Gly Ser Val Val Val Gly Gly Tyr Tyr Pro Thr Glu Val Trp Tyr 
50 55 60 

Asn Cys Ser Arg Ser Ala Thr Thr Thr Ala Tyr Lys Asp Phe Ser Asn 
65 70 75 80 

lie His Ala Phe Tyr Phe Asp Met Glu Ala Met Glu Asn Ser Thr Glv 

90 95 



Asn Ala Arg Gly Lys Pro Leu Leu Val His Val His Gly Asp Pro Val 

100 105 no 

Ser He lie lie Tyr He Ser Ala Tyr Arg Asp Asp Val Gin Gly Arg 
115 120 125 

Pro Leu Leu Lys His Gly Leu Leu Cys He Thr Lys Asn Lys He He 

135 140 



Asp Tyr Asn Thr Phe Thr Ser Ala Gin Trp Ser 
145 is© 



Ala He Cys Leu Gly 

160 

Asp Asp Arg Lys He Pro Phe Ser Val He Pro Thr Gly Asn Gly Thr 

1SS 170 i7s 

Lys He Phe Gly Leu Glu Trp Asn Asp Asp Tyr Val Thr Ala Tyr He 

180 185 190 



Ser Asp Arg Ser 
195 

Val Thr He Leu 
210 



His Leu Asn 
200 



Asn Trp Phe Asn Asn 
205 

Thr Trp Gin Lys Ser 
220 



Ala Ala Tyr Val Tyr Gin Gly Val Ser Asn Phe Thr Tyr Tyr Lys Leu 
225 230 235 240 

Asn Asn Thr Asn Gly Leu Lys Ser Tyr Glu Leu Cys Glu Asp Tyr Glu 

255 



Cys Cys Thr Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr Val Gly Glv 

260 265 270 



Tyr He 



Pro Asp Gly 
275 



Ser Phe. Asn Asn Trp Phe Met Leu Thr Asn 
280 2S5 



Ser Ser Thr Phe Val Ser Gly Arg Phe Val Thr Asn Gin Pro Leu Leu 
290 295 3O0 

val Asn Cys Leu Trp Pro Val Pro Ser Leu Gly val Ala Ala Gin Glu 
305 315 320 
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Phe Cys Phe Glu Gly Ala Gin Phe Ser Gin Cys Asn Gly Val Ser Leu 

325 330 335 

Asn Asn Thr Val Asp Val He Arg Phe Asn Leu Asn Phe Thr Thr Asp 

340 345 350 

Val Gin Ser Gly Met Gly Ala Thr Val Phe 
355 360 

(2) INFORMATION FOR SEQ 10 NO: 54; 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH z 1101 amino acids 

(B) TYPE: amino acid 

{ D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Ala Ala Tyr Val Tyr Gin Gly Val Ser Asn Phe Thr Tyr Tyr Lys Leu 
1 5 10 .15 

Asn Asn Thr Asn Gly Leu Lys Ser Tyr Glu Leu CyB Glu Asp Tyr Glu 

20 25 30 

Cys Cys Thr Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr Val Gly Gly 
35 40 45 

Tyr lie Pro Asp Gly Phe Ser Phe Asn Asn Trp Phe Met Leu Thr Asn 
50 55 60 

Ser Ser Thr Phe Val Ser Gly Arg Phe Val Thr Asn Gin Pro Leu Leu 
65 70 75 8.0 

Val Asn Cys Leu Trp Pro Val Pro Ser Leu Gly Val Ala Ala Gin Glu 

85 90 95 

Phe Cys Phe Glu Gly Ala Gin Phe Ser Gin Cys Asn Gly Val Ser Leu 

100 105 110 

Asn Asn Thr Val Asp Val lie Arg Phe Asn Leu Asn Phe Thr Thr Asp 
115 120 125 

Val Gin Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr Gly 
130 135 140 

Gly Val lie Leu Glu He Ser Cys Tyr Asn Asp Thr Val Ser Glu Ser 
145 150 155 160 

Ser Phe Tyr Ser Tyr Gly Glu He Ser Phe Gly Val Thr Asp Gly Pro 

165 170 175 

Arg Tyr Cys Tyr Ala Leu Tyr Ash Gly Thr Ala Leu Lys Tyr Leu Gly 

180 185 190 

Thr Leu Pro Pro Ser Val Lys Glu He Ala He Ser Lys Trp Gly His 
195 200 205 

Phe Tyr He Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro He Asp Cys 
210 215 220 
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lie Ser Phe Asn Leu Thr 
225 230 

Ala Tyr Thr Ser Tyr Thr 

245 

Xle Lys Lys Val Thr Tyr 

260 

Ser Gin Leu Thr Ala Asn 
275 



Gly Asp Ser Gly Ala Phe Trp Thr lie 

235 " 240 

Ala Leu Val Gin Val Glu Asn Thr Ala 
250 



Aen Ser His lie Asn Asn He Lys Cys 

270 



Gin Asn Gly Phe Tyr Pro Val Ala Ser 
280 



Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro Ser Phe 
290 295 300 

Tyr Ser His Thr Ser Val Asn He Thr He Asp Leu Gly Met Lys Arg 



310 



315 



320 



ser Gly Tyr Gly Gin Pro He Ala Ser Thr Leu Ser Asn He Thr Leu 

325 330 335 

Pro Met Gin Asp Asn Asn Thr Asp Val Tyr Cys He Arg Ser Asn Gin 

340 345 350 



Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu Trp Asp Asp 



Val Phe Asn Ser Asp Cys Thr Asp Val Leu Tyr Ala Thr Ala Val He 



380 



Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr Leu 



390 



400 



Thr Phe Asa Lys Phe Cys L&u Ser Leu Asn Pro Val Gly Ala Asn Cys 



405 410 415 

Lys Phe Asp Val Ala Ala Arg Thr Arg thr Asn Glu Gin Val Val Arg 

A ~" r 430 



Ser Leu Tyr Val lie Tyr Glu Glu Gly Asp Asn He Val Gly Val Pro 



435 440 445 

Ser Asp Asn Ser Gly Leu His Asp Leu Ser Val Leu His Leu Asp Ser 

4 60 



450 

Cys Thr Asp Tyr Asn lie Tyr Gly Arg Thr Gly Val Gly He rie Arq 
465 470 47 S 48 g 

Gin Thr Asn Ser Thr Leu Leu Ser Gly Leu Tyr Tyr Thr Ser Leu Ser 

485 49 0 495 

Gly Asp Leu Leu Gly Phe Lys Asn Val Ser Asp Gly Val He Tyr- Ser 

500 505 5io 

Val Thr Pro cys Asp Val Ser Ala Gin Ala Ala Val He Asp Gly Ala 
515 520 525 

He val Gly Ala Met Thr Ser He Asn Ser Glu Met Leu Gly Leu Thr 
530 . 535 540 

His Trp Thr Thr Thr Pro Asn Phe Tyr Tyr Tyr Ser lie Tyr Asn Tyr 

555 5£Q 
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Thr Asn Glu Arg Thr Arg Gly Thr Ala lie Asp Ser Asn Asp Val Asp 

565 570 575 

Cys Glu Pro He lie Thr Tyr Ser Aen He Gly Val Cys Lys Asn Gly 

580 585 - 5 90 

Ala Leu Val Phe He Asn Val Thr His Ser Asp Gly Asp Val Gin Pro 
595 600 605 

He Ser Thr Gly ABn Val Thr He Pro Thr Asn Phe Thr He Ser Val 
610. 615 620 

Gin Val Glu Tyr lie Gin Val Tyr Thr Thr Pro Val Ser He Asp Cys 
625 630 635 640 

Ser Arg Tyr Val Cys Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu Thr 

645 650 655 

Gin Tyr Val Ser Ala Cys Gin Thr lie Glu Gin Ala Leu Ala Met Gly 

660 665 670 

Ala Arg Leu Glu Asn Met Glu lie Asp Ser Mot Leu Phe Val Ser Glu 
675 6S0 685 

Asn Ala Leu Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Thr Glu Thr 
690 695 700 

Leu Asp Pro He Tyr Lys Glu Trp Pro Asn He Gly Gly Ser Trp Leu 
705 710 715 720 

Gly Gly Leu Lys Asp lie Leu Pro Ser His Asn Ser Lys Arg Lys Tyr 

725 730 735 

Arg Ser Ala He Glu Asp Leu Leu Phe Asp Lys Val Val Thr Ser Gly 

740 745 750 

Leu Gly Thr Val 'Asp Glu Asp Tyr Lys Arg Cys Thr Glv Gly Tyr Asp 
755 760 765 

* 

He Ala Asp Le* Val Cys Ala Gin Tyr Tyr Asn Gly He Met Val Leu 
770 775 780 

Pro Gly Val Ala Asn Asp Asp Lys Met Ala Met Tyr Thr Ala Ser Leu 
785 790 795 800 

Ala Gly Gly He Thr Leu Gly Ala Leu Gly Gly Gly Ala Val Ser He 

805 810 815 

Pro Phe Ala He Ala Val Gin Ala Arg Leu Asn Tyr Val Ala Leu Gin 

S20 825 830 

Thr Asp Val Leu Ser Lys Asn Gin Gin He Leu Ala Asn Ala Phe Asn 
835 840 845 

Gin Ala He Gly Asn He Thr.qin Ala Phe Gly Lys Val Asn Asp Ala 
850 855 860 

He His Gin Thr Ser Gin Gly Leu Ala Thr Val Ala Lys Ala Leu Ala 
865 870 875 ** 080 

Lys Val Gin Asp Val Val Asn Thr Gin Gly Gin Ala Leu Ser His Leu 

885 890 
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Thr Val Gin Leu Gin Asn Asn Phe Gin Ala He Ser Ser Ser He Ser 

900 905 gio 

Asp He Tyr Asn Arg Leu Asp Glu Leu Ser Ala Asp Ala Gin Val Aso 
915 920 



Arg Leu He Thr Gly Arg Leu Thr Ala Leu Asn Ala Phe Val Ser Gin 
930 935 940 

Thr Leu Thr Arg Gin Ala Glu Val Arg Ala Ser Arg Gin Leu Ala Lvs 
945 950 955 960 

Asp Lys Val Asn Glu cys Val- Arg Ser Gin Ser Gin. Arg Phe Gly Phe 

965 970 975 

Cys Gly Asn Gly Thr His Leu Phe Ser Leu Ala Asn Ala Ala Pro Asn 

980 985 990 

Gly Met He Phe Phe His Thr Val Leu Leu Pro Thr Ala Tyr Glu Thr 
995 1000 1005 

Val Se ^ «Y Ile Cya Ala Ser Asp Gly Asp Arg Thr Phe 

1°10 1015 1020 

Gly Leu Val Val Lys Asp Val Gin Leu Thr Leu Phe Arg Asn Leu Asp 
1025 1°30 1035 1040 



Asp Lys Phe Tyr Leu Thr Pro Arg Thr Met Tyr Gin Pro Arg val Ala 

1Q 45 1050 1055 

Thr Ser Ser Asp Phe Val Gin He Glu Gly Cys Asp Val Leu Phe Val 

1060 1065 1070 

Asn Ala Thr Val He Asp Leu Pro Ser He lie Pro Asp Tyr He Asp 
1075 1080 1085 

He Asn Gin Thr Val Gin Asp He Leu Glu Asn Phe Aro 
1090 1095 1100 

(2) INFORMATION FOR SEQ ID NO? 55s 

(i) SEQUENCE CHARACTERISTICS? 

(A) LENGTH: 701 base pairs 

(B) TYPEs nucleic acid 
{C) STRANDEDNESS s double 
(D) TOPOLOGY s unknown 

(ii) MOLECULE TYPEs DNA (genomic) 

(3cl> SEQUENCE DESCRIPTIONS SEQ ID NOtSSs 

TCAACCATTA TTGGTTAATT GTTTGTGGCC AGTGCCCAGT CTTGGTGTCG CAGCACAAGA SO 

ATTTTGTTTT GAAGGTGCGC AGTTTAGCCA ATGTAATGGT GTGTCTTTAA ACAATACAGT 120 

GGATGTCATT AGATTCAACC TTAATTTTAC CACAGATGTA CAATCTGGTA TGGGTGCTAC 180 

AGTATTTTCA CIGAATACAA CAGGTGGTGT CATTCTTOAG ATTTCTTGTT ATSATGATAC 240 

AGTGAGTGAG TCAAGTTTCT AC&GTTATGG TGAAATtTGA TTCGGCGTAA CTGATGGACC 300 

GCGTTACTGT TACGCACTCT ATAATGGCAC GGCTCTTAAG TATTTAGGAA CATTACCACC 360 



9 
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TAGTGTCAAG GAAATTGCTA TTAGTAAGTG GGGCCATTTT TATATTAATG GTTACAATTT 



420 



CTTTAGCACT TTTCCTATTG ATTGTATATC TTTTAATTTA ACCACTGGTG ATAGTGGAGC 



480 



ATTTTGGACA ATTGCTTACA CATCGTAGAC TGACGCATTA GTACAAG TTG AAAACACAGC 



540 



TATTAAAAAG GTGACGTATT GTAACAGTCA CATTAATAAC ATTAAATGTT CTCAACTTAC 



600 



TGCTAATTTG CAAAATGGAT TTTATCCTGT TGCTTCAAGT GAAGTTGGTC TTGTCAATAA 



660 



GAGTGTTGTG TTACTACCTA GTTTCTATTC ACATACCAGT G 



701 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



AGCACCGGTA ATGTCACGAT 


ACCTACAAAT 


TTTACCATAT 


CTGTGCAAGT 


TGAGTACATT 


60 


CAGGTTTACA 


CTACACCGGT 


GTCAATAGAT 


TGTTCAAGGT 


ACGTTTGCAA 


TGGTAACCCT 


120 


AGATGCAATA 


AATTGTTAAC 


GCAATACGTT 


TCTGCATGTC 


AAACTATTGA 


GCAAGCACTT 


180 


GCAATGGGTG 


CCAGACTTGA 


AAACATGGAG 


ATTGATTCCA 


TGTTGTTTGT 


TTCGGAAAAT 


240 


GCCCTTAAAT 


TGGCATCTGT 


TGAAGCATTC 


AATAGTACGG 


AAACTTTAGA 


TCCTATTTAC 


300 


AAAGAATGGC 


CTAACATTGG 




CTAGGAGGTT TAAAAGACAT ATTGCCATCT 


360 


CACAACAGCA 


AACGTAAGTA 


CCGGTCGGCT 


ATAGAAGATT 


TGCTTTTTGA 


TAAGGTTGTA 


420 


ACATCTGGCT 


TAGGTACAGT 


TGATGAAGAT 


TATAAACGTT 


GTACAGGTGG 


TTATGACATA 

• 


480 


GCTGACTTAG 


TGTGTGCACA 


ATATTACAAT 


GGCATCATGG 


TGCTACCTGG 


TGTAGCTAAT 


540 


GATGACAAGA 


TGGCTATGTA 


CACTGCATCT 


CTTGCAGGTG GTATAACATT AGGTGCACTT 


600 


GGTGGTGGCG 


CAGTGTCTAT 


ACCTTTTGCA 


ATAGCAGTTC 


AACCCAGACT 


TAATTATGTT 


660 


GCTCTACAAA 


CTGATGTATT 


GAGCAAGAAC 


CAGCAGATCC 


TGGCTAATGC 


TTTCAATCAA 


720 


GCTATTGGTA 


ACATTACACA 


GGCATTTGGT 


AAGGTTAATG 


ATGCTATACA 


TCAAACGTCA 


780 


CAAGGTCTTG 


CTACTGTTGC 


TAAAGCATTG 


GCAAAAGTGC 


AAGATGTTGT 


TAACACACAA 


840 


GGGCAAGCTT 


TAAGCCACCT 


AACAGTACAA 


TTGCAAAATA 


ATTTCCAAGC 


CATTAGTAGT 


900 


TCCATTAGTG 


ACATTTATAA 


CAGGCTTGAT 


GAATTGAGTG 


CTGATG CACA 


AGTTGACAGG 


960 


CTGATTACAG 


GAAGACTTAC 


AG C ACTTAAT 


GCATTTGTGT 


CTCAGACTTT 


AACCAGACAA 


1020 


GCAGAGGTTA 


GGGCTAGCAG 


ACAGCTTGCT 


AAAGACAAGG 


TAAATGAATG 


CGTTAGGTCT 


1080 


CAATCTCAGA 


GATTTGGATT 


CTGTGGTAAT 


GGTACACATT 


T ATTT TCACT 


TGCAAATGCA 


1140 
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GCACCAAATG GCATGATCTT CTTTCACACA GTGCTATTAC CAACAGCTTA TGAAACCGTG 1200 

ACGGCCTGGT CAGGTATTTG TGCATCAGAT GGCGATCGTA CTTTTGGACT TGTTGTTAAG 1260 

GATGTCCAGT TGACGCTGTT TCGCAATCTA GATGACAAAT TCTATTTGAC TCCCAGAACT 1320 

ATGTATCAGC CTAGAGTTGC AACTAGTTCT GATXTTGTTC AAATTGAAGG ATGTGATGTG 1380 



TTGTTTGTTA ATGCAACTGT A 



1401 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE s amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTXON: SEQ ID NO: 57: 

Met He Val Leu Val Thr Cys Leu Leu Phe 
15 10 

Cys Thr Ser Asn Asn Asp Cys Val Gin Val 

20 



Ser Tyr 



Ser Val He 
15 



Gly Asn Glu Asn He He Lys Asp Phe Leu 

40 



Glu Gly Ser Val Val Val Gly Gly Tyr Tyr 
50 



Val Thr Gin Leu Pro 
30 

His Thr Phe Lys Glu 
45 

Thr Glu Val Trp Tyr 
60 

Asn Cys ser Arg Ser Ala Thr Thr Thr Ala Tyr Lys Asp Phe Ser Asn 
65 70 75 80 

He His Ala Phe Tyr Phe Asp Met Glu Ala Met Olu Asn Ser Thr Gly 

85 90 95 

Asn Ala Arg Gly Lys Pro Leu Leu Val His Val His Gly Asp Pro Val 

100 105 no 



Ser He He He Tyr lie Ser Ala Tyr Arg: Asp Asp Val Gin Gly Arg 
115 120 

Pro Leu Leu Lys His Gly 
130 

Asp Tyr Asn Thr Phe Thr 
145 ISO 



Cys He Thr Lys Asn Lys He He 

140 

Gin Trp Ser; Ala He Cys Leu Gly 

160 



Asp Asp Arg Lys Xle Pro 

165 

Lys He Phe Gly Leu Glu 

180 



Val lie Pro Thr Gly Asn Gly Thr 
170 175 

Asp Asp Tyr Val Thr Aia !cyr lie 
185 190 



Val Thr He Leu: 
210 



Asn He Asn Asn Asn Trp Phe Asn Asn 
200 205 

Ser Ser Thr Ala Thr Trp Gin Lys Ser 

220 
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Ala Ala Tyr Val Tyr Gin Gly Val Ser Aan Phe Thr Tyr Tyr Lys Leu 
225 230 235 240 

Asn Asn Thr Asn Gly Leu Lys Ser Tyr Glu 

245 250 

(2) INFORMATION FOR SEQ ID NO: 58s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Ser Phe Asn Leu Thr Thr Gly Asp Ser Gly Ala Phe Trp Thr lie Ala 
15 10 15 

Tyr Thr Ser Tyr. Thr Asp Ala Leu Val Gin Val Glu Asn Thr Ala He 

20 25 30 

Lys Lys Val Thr Tyr Cys Asn Ser His He Asn Asn He Lys Cys Ser 
35 40 45 

Gin Leu Thr Ala Asn Leu Gin Asn Gly Phe Tyr Pro Val Ala Ser Ser 
50 55 60 

Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro Ser Phe Tyr 
65 70 75 ■ 80 

Ser Hie Thr Ser Val Asn He Thr He Asp Leu Gly Met Lys Arg Ser 

85 90 95 

Gly Tyr Gly Gin Pro He Ala Ser Thr Leu Ser Asn He Thr Leu Pro 

100 105 HO 

Met Gin Asp Asn Asn Thr Asp Val Tyr Cys He Arg Ser Asn Gin Phe 
115 120 125 

Ser Val Tyr Val His Ser Thr Cys LyB Ser Ser Leu Trp Asp Asp Val 
130 135 140 

Phe Asn Ser Asp Cys Thr Asp Val Leu .Tyr Ala Thr Ala Val He Lys 
14 5 150 155 160 

Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr Leu Thr 

165 170 175 

Phe Asn Lys Phe Cys Leu Ser Leu Asn Pro Val Gly Ala Asn Cys Lys 

180 185 190 

Phe Asp Val Ala Ala Arg Thr Arg Thr 
195 200 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TXPE: protein 

(xi) SEQUENCE DESCRIPTION* SEQ ID NO: 59s 

Glu Asn Met Glu He Asp Ser Met Leu Phe Val Ser Glu Asn Ala Leu 
IS 10 15 

Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Thr Glu Thr Leu Asp Pro 

20 25 30 

He Tyr Lys Glu Trp Pro Asn lie Gly Gly Ser Trp Leu Gly Gly Leu 
35 40 45 

Lye Asp He Leu Pro Ser His Asn Ser Lys Arg Lye Tyr Arg Ser Ala 
50 55 60 

He Glu Asp Leu Leu Phe Asp Ly b Val Val Thr Ser Gly Leu Gly Thr 
65 70 75 " 80 

Val Asp Glu Asp Tyr Lys Arg Cye Thr Gly Gly Tyr Asp He Ala Asp 

90 95 



Leu Val Cys Ala Gin Tyr Tyr Asn Gly lie Ket Val Leu Pro Gly Val 

100 105 110 

Ala Asn Asp Asp Lys Met Ala Met Tyr Thr Ala Ser Leu Ala Gly Gly 

120 125 



He Thr Leu Gly Ala Leu Gly Gly Gly Ala Val Ser He Pro Phe Ala 
130 135 140 

He Ala Val Gin Ala Arg Leu Asn Tyr Val Ala Leu Gin Thr Asp Val 
145 150 ISS 160 

Leu Ser Lys Asn Gin Gin He Leu Ala Asn Ala Phe Asn Gin Ma He 

170 175 



Gly Aen He Thr Gin Ala Phe Gly Lys Val Asn Asp Ala He His Gin 

ISO 185 190 

Thr Ser Gin Gly Leu Ala Thr Val Ala Lys Ala Leu Ala Lys Val Gin 

200 205 



Asp Val Val Asn Thr Gin Gly Gin Ala Leu Ser His Leu Thr Val Gin 
210 215 220 

Leu Gin Asn Asn Phe Gin Ala He Ser Ser Ser He Ser Asp He Tyr 
225 230 235 240 

Asn Arg Leu Asp Glu Leu Ser Ala Asp Ala Gin 

245 250 
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What is claimed is: 



1. An isolated protein sequence comprising a 
selected sequence from the s protein of a canine 
coronavirus strain, optionally fused to a second selected 
fusion protein. 

2. The protein according to claim 1 wherein 
said strain is CCV 1-71. 

3. The protein according to claim 1 comprising 
amino acid residues 1 to 1452 SEQ ID NO: 2. 

4. The protein according to claim 1 wherein 
sequence is selected from the group 

consisting of: 1113-1236 SEQ ID N0:25, 540-599 SEQ ID 
N0:27, 342-388 SEQ ID N0:29, 137-153 SEQ ID NO:31, 375-388 
SEQ ID NO: 33, 1424-1440 SEQ ID NO: 35, 1407-1420 SEQ ID 
N0:37, 1342-1406 SEQ ID NO:39, 398-652 SEQ ID NO:44 r 128- 
555 SEQ ID NO:43, and 447-628 SEQ ID N0:45. 

5. An isolated DNA sequence comprising a 
selected nucleotide sequence from the S gene of a canine 
coronavirus strain, optionally associated with the 
nucleotide sequence encoding a fusion protein. 

6. The DNA sequence according to claim 5 
wherein said selected sequence comprises nucleotides 1 to 

4356 SEQ ID NO: 1. 

7. The DNA sequence according to claim 5 wherein 
the selected sequence is a nucleotide sequence selected 
from the group consisting of: 3337-3708 SEQ id NO: 24, I6is- 

1797 SEQ ID NO: 26, 1024-1164 SEQ ID NO: 28, 409-459 SEQ ID 
NO: 30, 1123-1164 SEQ ID NO:32, 4270-4320 SEQ ID NO:34, 
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4219-4260 SEQ ID NO: 34 , 4024-4218 SEQ ID NO: 38, 1192^1956 
SEQ ID NO:40, 382-1665 SEQ ID NO:42, and 1339-1884 SEQ ID 
NO:44. 

8 . A method for the production of a recombinant 
CCV protein comprising culturing a selected host 
transformed with a DMA sequence encoding a selected CCV s 
protein or fragment thereof in operative association with 
regulatory sequences capable of regulating the expression 
of said protein. 

9. The method according to claim 8 wherein said 
host is a mammalian cell. 



1Q . The method according to claim 8 wherein said 
host is a viral vector. 

11. A recombinant DNA molecule comprising a DNA 
sequence coding for a selected portion of a canine 
coronavirus s protein*, said DNA sequences in operative 
association with regulatory sequences capable of directing 
the expression thereof in host cells, 

12. A vaccine composition comprising an 
effective amount of a canine coronavirus protein comprising 
a selected canine coronavirus strain S protein , or 
immunogenic fragment thereof and an optional carrier. 

13. A composition according to claim 12 wherein 

said strain is CCV 1-71. * 

14. The composition according to claim 12 * 
wherein said S protein is a fusion protein. 
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15. The vaccine composition according to claim 
12 further comprising an immunogenic amount of one or more 
additional antigens. 

16. A method for vaccinating an animal against 
CCV gastroenteritis which comprises the step of internally 
administering to the animal an effective amount of a CCV s 
protein, S fusion protein or an immunogenic fragment 
thereof. 

17. An antibody to a protein comprising a 
selected sequence from the S gene of a canine coronavirus 
strain, said antibody being specific for a CCV S gene 
epitope. 

18. The protein according to claim 17 wherein 
said strain is CCV 1-71 » 

19. A diagnostic reagent comprising a selected 
sequence from the S protein of a canine coronavirus strain , 
optionally fused to a second selected fusion protein, said 
sequence optionally associated with a detectable label. 

20. A diagnostic reagent comprising an antibody 
to a protein comprising a selected sequence from the S gene 
of a canine coronavirus strain , said antibody being 
specific for a CCV S gene epitope and said' antibody 
optionally associated with a detectable label. 

21. A diagnostic reagent which comprises a 
nucleotide sequence encoding or flanking a CCV s protein or 
fragment, said nucleotide sequence optionally associated 
with a detectable label. 
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22. A diagnostic kit comprising one or more 
diagnostic reagents selected from the group consisting of 

(a) a selected sequence from the S protein 
of a canine coronavirus strain , optionally fused to a 
second selected fusion protein, said sequence optionally 
associated with a detectable label; 

(b) an antibody to a protein comprising a 
selected sequence from the S gene of a canine coronavirus 
strain f said antibody being specific for a CCV S gene 
epitope and said antibody optionally associated with a 
detectable label; and 

(c) a nucleotide sequence encoding or 
flanking a CCV S protein or fragment r said nucleotide 
sequence optionally associated with a detectable label. 

23- A method of diagnosing CCV infection in dogs 
comprising treating a tissue sample from a dog with a 
diagnostic reagent of claim 22. 

24* The method according to claim 23 wherein 
dogs previously exposed to CCV or to a CCV vaccine arfe 
detected . 

25. The method according to claim 23 wherein 
said diagnostic method can differentiate exposure to CCV 
from exposure to another related coronavirus. 

26. The method according to claim 23 wherein 
said diagnostic method can .differentiate exposure to 
different strains of CCV. ^ 

27 „ The method according to claim 23 wherein $ 
said method can identify dogs at advanced stages of CCV 
infection. 
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